Senior Site Reliability Engineer (Remote)

Los Angeles, CA, United States, Remote


Role Locations

  • Los Angeles, CA, United States
  • Remote


251 - 500 people


2030 East Maple Avenue
El Segundo, CA, 90245, US

Tech Stack

  • Go
  • AWS
  • Node.js
  • JavaScript
  • React
  • MySQL
  • Docker
  • HTML
  • css
  • Terraform
  • Ansible
  • Express.js
  • TypeScript
  • Angular
  • Google Cloud Platform

Role Description

GoGuardian is on a mission to transform education by helping to protect students in the digital space from harmful and distracting content and supporting their mental health. We partner with schools to identify learning patterns and maximize the academic potential of every student. With GoGuardian, educators can engage students with more effective resources while also promoting online accessibility in the most applicable way for their school population.

While this job posting uses El Segundo, CA as the location, please note this role is currently being recruited for remote work or reporting to headquarters if within commuting distance.

As a Senior Site Reliability Engineer - Pear Deck, you will be helping us focus our expectations around availability, correctness, and performance while building tools and sharing expertise with the team to ensure our service continues to meet expectations as it scales. The work will cover a wide area, from directly improving our core services to on-call and incident analysis, education around scaling and resilience, and feedback into the product itself.

What You'll Do

Implement and provision necessary infrastructure changes for continued and/or improved site reliability Plan and Implement changes to reduce toil Read, understand, and review application code to support software development efforts from a reliability / infrastructure perspective Monitor health of production infrastructure and investigate/analyse any issues and abnormalities to identify problems or bottlenecks Communicate uptime and quality of service issues effectively Demand Forecasting and Capacity planning for continued and/or improved site reliability On call rotations and incident response during off-hours Implement and deploy hotfixes as necessary Plan, track and perform routine system maintenance and software updates to infrastructure Track and document reliability related issues and incidents Map business goals to architectural/infrastructure decisions

Who You Are

5+ years and/or 3+ previous industry position(s) w/ dedicated experience as an operations engineer, devops engineer or SRE supporting SaaS applications in large-scale cloud environments Direct involvement in shipping multiple production SaaS applications / products in varied disciplines or verticals Proficiency with the following technologies and practices: Google Cloud Platform, Kubernetes (CKA or similar certification preferred), Docker, Terraform On the job exposure or experience with the following technologies: Python, Prometheus, MongoDB, Redis, Firebase Realtime Database, BigQuery Writes production-grade code for well-scoped features; integrates feedback from code reviewers Learns quickly, applies existing knowledge to new challenges and is building mastery in relevant technical skills Confident in making technical decisions and explaining the reasoning behind them Comfortable developing solid technical solutions to ambiguous or open-ended problems Driven to teach, lead and help others in areas of strongest skill and experience Has software development experience and/or understanding of programming languages, data structures and algorithms

What We Offer?

A varied and challenging role in a multinational and highly innovative company A robust benefits package including health insurance, 401(k) retirement savings plan with company match, employee stock option plan, paid parental leave, 13 paid company holidays, and much more Development and further training opportunities for shaping and realizing your career goals Exceptional colleagues with a passion for EdTech

About GoGuardian

GoGuardian is on a mission to supercharge human potential by creating the ultimate learning platform. We help thousands of K-12 schools and districts maximize the learning potential of every student by enabling more productive, effective, and safer digital learning.

Company Culture

Our Guiding Principles describe how we work, what we value, and our decision-making philosophy. Our team is truly passionate about creating solutions that take digital learning to new heights, and that passion is at the heart of everything we do. Yet we also know it’s not just what we do but how we do it, and we’re proud to have built a purpose-driven culture of collaboration, openness, trust, and transparency.

Interested in this role?
Skip straight to final-round interviews by applying through Triplebyte.