Senior Spark Engineer

San Francisco, CA, United States




Role Location

  • San Francisco, CA, United States


501+ people


225 Bush St Ste 1700
San Francisco, CA, 94104, US

Tech Stack

  • Java
  • MySQL
  • Go
  • Kubernetes
  • Hadoop
  • Apache Spark
  • Kafka
  • React
  • Ruby on Rails
  • Google Cloud

Role Description

LiveRamp is looking for engineers to help us maintain and scale our Big Data infrastructure. Our data infrastructure team operates one of the largest Hadoop deployments in the world, with 90,000 CPUs, 300TB of memory, and 30 PB of storage. We run 100,000 YARN applications and deliver billions of customer records per day to partners.

Running this many data applications takes a heavy support stack. Zookeeper coordinates data deliveries. HashiCorp Consul exposes services. Vault stores secrets. We use open-source tools when they make sense, but we aren’t afraid to get our hands dirty when they don’t -- we work at a scale that only a few companies in the world match and our most interesting tools come from solving those challenges.

The most valuable part of our infrastructure is the 60+ engineers writing hundreds of big data applications on our infrastructure, across a dozen application teams. These engineers are our customers, and our job is to work closely with them to support building efficient applications via both education and building the tools they need to get the job done.

You will: Build big data applications and tooling to assist 60+ fellow developers in writing hundreds of big data applications on our infrastructure. Ensure operational excellence and empower the organization to achieve a high level of technical productivity, reliability, and simplicity. Act as technical point of contact for large re-architectures that touch all LiveRamp systems Foster a positive environment of integrity, empowerment, initiative, and teamwork. Be responsible for developing & managing our Spark environment Be responsible for developing tools & infrastructure for data processing - set of libraries and/or scripts that would be utilized by product engineers - e.g. monitoring frameworks, integrations with various types of storages, regression/sandbox testing, format conversions, tools for debugging Spark. Provide guidance and collaborate with product engineering teams in the context of their big data pipelines Closely work with and iterate on feedback from your customers -- other developers. You err on the side of shipping fast and iterating. Your team will: Maintain and scale our Big Data infrastructure. Engineer scalable data processing engines of stunning complexity. Build tools to empower LiveRamp’s developers to operate effectively and efficiently. Deliver on the technical roadmap for LiveRamp’s core product offerings. About you: 5+ years of experience in big data engineering Have strong experience programming in Spark, Scala/Java or Python You are familiar with setting, fine-tuning and managing Spark environments Have a solid track record of operating in massive-scale data environments Solid understanding of big data ecosystem - current trends, pros & cons of specific technologies (e.g. RDBMS vs NoSQL database, in-memory processing, disk I/O) You keep up-to-date with cutting edge big data frameworks; you enjoy digging into the tradeoffs of Spark vs Flink vs MapReduce for a new project. You are both productive and pragmatic -- software is only useful if it is used. You have world-class debugging skills You get excited by when you make a Hadoop application more efficient - but you get even more excited when you help developers be more efficient You value fast and honest feedback while maintaining a lighthearted attitude Bonus Points: Familiar with streaming frameworks, e.g. Flink, Spark Streaming, Kafka Past or active collaboration with Apache Open Source Community Experience in managing infrastructure cost tradeoffs You are familiar with maintaining Hadoop clusters Experience with hosted Hadoop solutions (Dataproc) Benefits: People. Work with talented, collaborative, and friendly people who love what they do. Food. Enjoy catered meals, boundless snacks, and the occasional food truck. Fun. We host events such as game nights, happy hours, camping trips, and sports leagues. Stock. Every employee is a stakeholder in our future. Health and Saving. Receive the benefits of comprehensive health, dental, vision and disability insurance along with a 401k matching plan. Location. Work in the heart of San Francisco and take advantage of our commuter benefits. More about us: LiveRamp is the trusted platform that makes data accessible and meaningful. Our services power people-based customer experiences that improve the relevance of marketing and allow consumers to better connect with the brands and products they love. We thrive on solving the toughest technical and customer challenges, and we’re always looking for smart, compassionate people to help us blaze a trail.

We value strong engineers who are agile enough to hit the ground running and tackle challenges.

To all recruitment agencies: LiveRamp does not accept agency resumes. Please do not forward resumes to our jobs alias, LiveRamp employees or any other company location. LiveRamp is not responsible for any fees related to unsolicited resumes.

About LiveRamp

LiveRamp's product is a massive identity graph connecting each individual to their online identifiers like cookies and device IDs.

Our engineering workflows can be broken into three parts: Data Ingestion, Data Manipulation, and Data Distribution. We ingest behavioral data indexed by offline and online identifiers from our clients, resolve every index to a single LiveRamp ID through our Identity Graph, and then distribute ingested data to other technology platforms in the industry.

Company Culture

All LiveRampers are smart, nice, and get things done. We move quickly and value autonomy and project ownership over all else. We empower our people to use their best judgment when making decisions. We grow our talent by challenging them with stretch projects and supporting them with mentorship.

Interested in this role?
Skip straight to final-round interviews by applying through Triplebyte.