Site Reliability Engineer

San Francisco, CA, United States


Role Location

  • San Francisco, CA, United States


101 - 250 people


799 Market St Fl 8
San Francisco, CA, 94103, US

Tech Stack

  • NodeJS
  • Swift
  • Android/Java
  • WebSockets
  • React
  • Node.js
  • MySQL

Role Description

Airtable is a mission-critical system for a diverse set of teams and industries. We build our platform to scale, stay resilient, and deliver delightful user experience all around. Our infrastructure requires thorough thinking, deep research into how things work, and rigorous coding. Our mission is ambitious, but we like to keep our infrastructure simple.

As one of the first dedicated site reliability engineers at Airtable, you will play a critical role in scaling and refining our operational practices. Site reliability engineering begins with building solid automation across the software delivery process, including configuration, provisioning, testing, deployment, and beyond. SREs will also work with software engineers to help understand the way their code behaves in production, and build nontrivial internal tooling to enable this. We also strive for a strong security posture, and SRE will help define and implement operational practices that protect our users. Lastly, of course, our operations team is the last line of defense when incidents happen, and SREs will be part of the team that responds to them.

What you'll do:

  • Automate everything: deploys, rollbacks, database provisioning, failovers, and everything in between.
  • Design and implement monitoring tooling across the stack, and optimize systems for uptime, performance, and reliability based on the data gathered by this tooling.
  • Design and write tests that investigate how our infrastructure handles failure and scaling.
  • Research hot-off-the-press CVEs and implement best practices.
  • Build occasional product features as appropriate.
  • Write solid, maintainable code (including a lot of JavaScript) for all of the above.
  • Manage our Elasticsearch cluster

Who you are:

  • You're painfully thorough, whether it's scripting bulletproof deployment automation, writing a recovery playbook that an engineer can follow without fail at 3 a.m., or digging into logs and monitoring data to find the root of a problem.
  • You're OK carrying a pager and take it seriously, but you take pride when the pager hasn't rung in the past week.
  • You've worked with Linux, containers/namespaces, and system automation tools for Unix and cloud platforms.
  • You have 5+ years of relevant technical experience, including significant experience with site reliability/devops or server infrastructure engineering.

About Airtable

Airtable's mission is to democratize software creation. Our product allows anyone to create with a platform that anyone can use to build their own tools and workflows. See our Airtable Universe gallery for examples of what people are doing with Airtable ( ).

  • We're sweeping across certain industries: for instance, in the media vertical, our paid customers include Buzzfeed, Condé Nast, Group Nine, HBO, Starz, Spotify, Netflix, Hearst, TIme, NYTimes, and CBS. Other innovative companies like WeWork, Clover Health, and Zapier are powered by Airtable nearly wall to wall, and we're a major part of their ability to innovate at scale.

  • People don't just use Airtable, they love it ( ).

  • We recently raised a series C, bringing our total amount raised to $170M in funding from notable investors including: Benchmark, Thrive, CRV, Caffeinated Capital, Freestyle Capital, Data Collective, Founder Collective, and CrunchFund.

  • Our team brings experience from building and scaling products touching millions of people at Google, Facebook, Dropbox, Microsoft and other leading technology companies. More about us:

  • Our company is built upon a philosophical commitment to providing the best possible experience for the end user, even if it requires going to great lengths in engineering and design effort. We plan on building this company for decades to come, and while we love rapid iteration where it makes sense, we also take the time to make farsighted architectural decisions and place great emphasis on code quality over raw speed.

Company Culture

We love polymathic people with open-mindedness and intellectual curiosity.

Interested in this role?
Skip straight to final-round interviews by applying through Triplebyte.