Staff SWE, Data Infrastructure

New York, NY, United States, Remote


Role Locations

  • New York, NY, United States
  • Remote


501+ people


116 W 23rd St
New York City, NY, 10011, US

Tech Stack

  • Python
  • PostgreSQL
  • Django
  • TypeScript
  • React

Role Description

We’re looking for an experienced Data Infrastructure Engineer who is motivated to strive for technical excellence, working on unique and high impact problems to empower teams across Ro’s business.

You're an engineer with great communication skills and a proven track record of productionizing Data Infrastructure systems. At Ro, you'll collaborate with technical and nontechnical stakeholders and fellow engineers from across the business to identify meaningful problems for our patients and our business, as well as, owning and leading the design and delivery of reliable, resilient, and scalable systems to solve those problems. You'll establish and promote best practices for working with data across the organization as we continue to scale.

What You'll Do:

Build, automate, and manage the life-cycle of the systems and platforms that collect, process, and surface our data (production side and analytics side)

Work with Data Scientists and Data Analysts to create workflows for developing and deploying production-ready machine learning systems

Own new initiatives by collaborating with other infrastructure engineers, software engineers, data analysts, ML experts, and stakeholders throughout the company

Be an excellent systems engineer, with reliability and scalability always top of mind

Work with stakeholders in Data and on other teams to assist with data-related technical issues and support their data infrastructure needs

Keep our data secure, including via appropriate access control mechanisms, a stringent secure-by-design mindset, and following industry best practices or forging new ones where needed

Provide visibility into the health of our data infrastructure

Manage the Data team’s infrastructure, including databases, computing resources, and orchestration

Build large-scale batch and real-time data pipelines with data processing frameworks (ex: Scio, Storm, Spark, EMR, etc)

Help drive optimization, testing, and tooling to improve data quality

Implement ETL processes and data pipelines to load data from a variety of first- and third-party applications

What You'll Bring:

Experience building, deploying, monitoring, and optimizing production data collection systems at scale

Strong analytical intuition; experience working with data analytics groups

A track record of project ownership for projects that involve many contributors and stakeholders, technical and non-technical alike

A track record of implementing and also developing industry best practices

Excellent communication skills, from written/verbal to documentation to presentations

Experience building libraries and tooling that provide beautiful abstractions to users

A track record of promoting best practices by example and of mentoring other engineers

About Ro


Company Culture

Mission drive and Open - everything we do is with Patient in mind and aimed to accelerate our Mission of increasing access to more affordable, high quality healthcare to substantially increase patient outcomes and improve lives

Interested in this role?
Skip straight to final-round interviews by applying through Triplebyte.