Site Reliability Engineer

San Francisco, CA, United States

Alliance of American Football


Role Location

  • San Francisco, CA, United States

Employees

251 - 500 people

Address

149 New Montgomery St Fl 4
San Francisco, CA, 94105, US

Tech Stack

  • iOS Development
  • Swift
  • Android
  • Kotlin
  • AWS
  • Go
  • C++
  • TypeScript
  • GraphQL

Role Description

As a Site Reliability Engineer you will be responsible for deploying, monitoring, and scaling our real time video platform. You will plan for our gameday spikes, and ensure that we maintain minimal latency.

Responsibilities: Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement. Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews. Maintain services once they are live by measuring and monitoring availability, latency and overall system health. Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. Practice sustainable incident response and blameless postmortems.

Qualifications: Experience with algorithms, data structures, complexity analysis and software design. Experience in one or more of the following: C++, Python, or Go. Interest in designing, analyzing and troubleshooting large-scale distributed systems. Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive. Ability to debug and optimize code and automate routine tasks.

Bonus Points: AWS certification as a Solutions Architect

Interested in this role?
Skip straight to final-round interviews by applying through Triplebyte.