Senior Site Reliability Engineer (SRE)

$200,000 - $275,000/Yr

Stubhub - Los Angeles, CA

posted 3 days ago

Full-time - Senior

Los Angeles, CA

Administrative and Support Services

About the position

StubHub is seeking a Senior Site Reliability Engineer (SRE) to design and develop next-generation technologies and complex features. This role involves tackling significant challenges and providing innovative technical solutions to ensure the reliability and performance of critical systems. The position is hybrid, requiring three in-person days per week, and is based in either New York, NY, Los Angeles, CA, or Aliso Viejo, CA.

Responsibilities

Build out and maintain an observability platform to ensure the reliability, availability, and performance of critical systems.
Collaborate with cross-functional teams to identify and address potential bottlenecks, optimize resource utilization, and proactively prevent system failures.
Drive the implementation of automation tools and Infrastructure as Code (IaC) practices to streamline deployment processes, configuration management, and infrastructure provisioning.
Help develop a center of excellence, fostering a culture of empowering teams to continuously and reliably deliver customer value.
Develop processes, tools and automation to reduce toil across engineering teams.
Ensure systems effectively balance cost, performance, and reliability at scale.

Requirements

Extensive experience (typically 5+ years) in a site reliability engineering or a related role, demonstrating a strong command of incident management, mitigation, & prevention, troubleshooting, and performance tuning.
Experience with developing robust, mission-critical systems using one or multiple general-purpose programming languages (e.g., C/C++, Java, C#, or any other OOP language).
Experience with cloud computing (AWS, GCP, Azure).
A strong track record of aggressively identifying and removing toil through process optimization, automation, and system design.
Demonstrated ability to write and maintain code for automation, infrastructure orchestration, and reliability tooling.
Demonstrated understanding of large scale observability platforms and tools.
Understanding of orchestration systems such as Kubernetes.

Benefits

Accelerated Growth Environment
Top Tier Compensation Package
Flexible Time Off
Comprehensive Benefits Package including 401k, Health, Vision, and Dental Insurance options
Team-Building Events

Senior Site Reliability Engineer (SRE)

About the position

Responsibilities

Requirements

Benefits

Tools

Career Hubs

Guides

Company