Stubhub - Los Angeles, CA

posted 3 days ago

Full-time - Senior
Los Angeles, CA
Administrative and Support Services

About the position

StubHub is seeking a Senior Site Reliability Engineer (SRE) to design and develop next-generation technologies and complex features. This role involves tackling significant challenges and providing innovative technical solutions to ensure the reliability and performance of critical systems. The position is hybrid, requiring three in-person days per week, and is based in either New York, NY, Los Angeles, CA, or Aliso Viejo, CA.

Responsibilities

  • Build out and maintain an observability platform to ensure the reliability, availability, and performance of critical systems.
  • Collaborate with cross-functional teams to identify and address potential bottlenecks, optimize resource utilization, and proactively prevent system failures.
  • Drive the implementation of automation tools and Infrastructure as Code (IaC) practices to streamline deployment processes, configuration management, and infrastructure provisioning.
  • Help develop a center of excellence, fostering a culture of empowering teams to continuously and reliably deliver customer value.
  • Develop processes, tools and automation to reduce toil across engineering teams.
  • Ensure systems effectively balance cost, performance, and reliability at scale.

Requirements

  • Extensive experience (typically 5+ years) in a site reliability engineering or a related role, demonstrating a strong command of incident management, mitigation, & prevention, troubleshooting, and performance tuning.
  • Experience with developing robust, mission-critical systems using one or multiple general-purpose programming languages (e.g., C/C++, Java, C#, or any other OOP language).
  • Experience with cloud computing (AWS, GCP, Azure).
  • A strong track record of aggressively identifying and removing toil through process optimization, automation, and system design.
  • Demonstrated ability to write and maintain code for automation, infrastructure orchestration, and reliability tooling.
  • Demonstrated understanding of large scale observability platforms and tools.
  • Understanding of orchestration systems such as Kubernetes.

Benefits

  • Accelerated Growth Environment
  • Top Tier Compensation Package
  • Flexible Time Off
  • Comprehensive Benefits Package including 401k, Health, Vision, and Dental Insurance options
  • Team-Building Events
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service