Site Reliability Engineer (SRE) Manager/Lead

San Francisco, CA, United States

Atlassian (Statuspage Team)


Role Location

  • San Francisco, CA, United States

Employees

501+ people

Address

1098 Harrison St
San Francisco, CA, 94103, US

Tech Stack

  • Ruby on Rails
  • AWS
  • Varnish
  • Redis
  • Elastic Search
  • VTM
  • Memcached
  • Terraform
  • React
  • JavaScript

Role Description

As a SRE Sr. Lead/Manager at Atlassian, you will join an engineering-led company and the award-winning leader in software development and collaboration tools with over 85,000 Customers.

Statuspage are seeking to add a seasoned SRE leader with experience scaling services that have been demanding in terms of real-world traffic & throughput to their growing team. Statuspage is at the forefront of reshaping incident management & helping customers turn downtime into a great customer experience. This group is scaling rapidly & can offer an open runway for the right person.

Once onboard, you'll technically grow & lead a team taking ops projects from concept through to launch. You will play in integral role in designing and developing operational improvements that will help us keep stellar uptime and reliability so the Internet can stay honest around uptime.

Leveraging your understanding of architecture & ability to get into the ebb & flow of the code, you’ll strive to harden our platform so we can grow by x100. You'll be involved in evolving and implementing our disaster recovery solution. You are steadfast, and you're unflappable when on call. You're a builder at heart and, ultimately, you will help define what it means to provide status updates for the Internet.

To be successful in this role; you’ll bring:

  • An extensive background in developing and operating large-scale cloud-based distributed applications
  • Direct experience developing/running Rails on AWS and are well-versed in standard methodologies and patterns
  • Laser focus and be able to design infrastructure solutions for scalability, reliability, high availability, performance, security, software maintainability, and operational excellence
  • The ability to fix the plane while in flight (not just support greenfield solutions)
  • The ability to prioritize existing technical and infrastructure debt, and experience to build and execute a plan to pay it off
  • Proven expertise working with others to drive alignment between architecture projects and product roadmap

In return, you will have the opportunity to:

  • Ask tough questions and develop your pragmatic approach to decision-making
  • Demonstrate your deep understanding of technologies like Elasticsearch, Redis, Postgres, Ruby and Rails.
  • Continue to grow your skills in running secure and scalable applications for highly available, multi-region, AWS deployments
  • Teach and influence engineers and other teammates
  • Build incredible software on a highly-effective TEAM
  • Ship code several times per day
  • This is both a people management & tech management position where you will lead a team of experienced, smart engineers based in downtown SF but form part of a global team.

More about our benefits

Our offices are open, highly collaborative and yes, fun! To support you at work (and play) we offer some fantastic perks: ample time off to relax and recharge, five paid volunteer days a year for your favorite cause, plenty of food and beverages, ergonomic workstations with sit/stand desks, unique ShipIt days, a company paid trip after five years, generous employer-paid insurance coverage (medical, dental, and vision) for you and your family, 401k matching and more.

More about Atlassian

Software is changing the world, and we’re at the center of it all. With a customer list that reads like a who's who in tech and a highly disruptive business model, we’re advancing the art of team collaboration with products like Jira, Confluence, Bitbucket, Trello, and now Stride. Driven by honest values, an amazing culture, and consistent revenue growth, we’re out to unleash the potential of every team. From Amsterdam and Austin to Sydney and San Francisco, we’re looking for people who are powered by passion and eager to do the best work of their lives in a highly autonomous yet collaborative, no B.S. environment.

About Atlassian (Statuspage Team)

Software is changing the world, and we’re at the center of it all. With a customer list that reads like a who's who in tech, and a highly disruptive business model, we’re advancing the art of team collaboration with products like Jira, Confluence, Bitbucket, Trello, Statuspage and now Stride.

Driven by honest values, an amazing culture, and consistent revenue growth, we’re out to unleash the potential of every team. From Amsterdam and Austin to Sydney and San Francisco, we’re looking for people who are powered by passion and eager to do the best work of their lives in a highly autonomous yet collaborative, no B.S. environment.

We are currently hiring for our Statuspage product.

Statuspage (https://www.statuspage.io/) is at the forefront of reshaping incident management & helping customers turn downtime into a great customer experience. We withstood the S3 and DYN outages and are building to withstand the next great Internet event. We’ll stay up when our customers are down!

Company Culture

It's our mission to unleash the potential in every team, and we know that teams perform best when they are diverse and every team member feels that they belong. It's the unique contributions of all Atlassians that drive our success, and we're committed to building a culture where everyone can thrive and find meaning in their work. Check out our core values here: https://www.atlassian.com/company/values

Interested in this role?
Skip straight to final-round interviews by applying through Triplebyte.