Big data with containers


COMPANY SIZE Less than 5

# OF ENGINEERS Less than 5


TAGS Analytics, Big Data, Developer APIs, Developer Tools, Distributed Systems / Scaling, YC Winter 2015


What do we do?

Pachyderm is a Data Lake. A place to dump and process gigantic data sets. Pachyderm is inspired by the Hadoop ecosystem but shares no code with it. Instead we leverage the container ecosystem to provide the broad functionality of Hadoop with the ease of use of Docker.

Pachyderm core features:
- Virtually limitless storage for any data.
- Virtually limitless processing power using any tools.
- Tracking of data history, provenance and ownership. (Version Control).
- Automatic processing on new data as it’s ingested. (Streaming).
- Chaining processes together. (Pipelining)

Why join us?

What would data analytics infrastructure (namely Hadoop) look like if we rebuilt it from scratch today? We think it would be containerized, modular, and easy enough for a single person to use while still being scalable enough for a whole company. Tools like Docker and Kubernetes provide the perfect building blocks for us revolutionize data infrastructure!

Pachyderm's is looking for our first hire! We went through YC W15, raised a strong seed round($2M), and are looking for someone to help lead our core engineering team. Pachyderm is just founders right now, so you'd be getting in right at the ground floor and have an enormous impact on the success and direction of the company as well as building the rest of the engineering team.

We pay competitive SF-level salaries along with significant equity, full benefits, and all the usual startup perks. This position is based in SF, but we offer full relocation assistance.

Read more about our long-term company vision:

Our press coverage

Our Founders

Joey Zwicker


Our tech stack

  • Golang
  • Kubernetes
  • Docker
  • Distributed Systems

Our investors

  • Data Collective
  • Susa Ventures
  • Foundation Capital
  • Blumberg Capital