Robloxposted 3 months ago
$192,890 - $238,520/Yr
Full-time • Senior
San Mateo, CA
Administrative and Support Services

About the position

As a Site Reliability Engineer (SRE) on the Infra Compute Orchestration (ICO) team, you will create, support, and evolve the infrastructure at Roblox as we build out Roblox's private cloud. ICO's mission is to own and manage our underlying orchestration systems along with elements of service discovery, secrets management and related software layers.

Responsibilities

  • Create systems & libraries that promote fault-tolerance and resilience- like retries, circuit breakers, and adaptive concurrency limits.
  • Build, automate and standardize process automation to create a 'golden path' of tooling and platform support that powers the fundamental Roblox ecosystem.
  • Create tooling that provides production guardrails, for example evaluating release candidate capacity with load testing tooling before deploying to production.
  • Create performance monitoring services and observability towards understanding capacity issues and platform degradations.
  • Create tooling that monitors production services and their changes, like generalized canarying services with alerting.
  • Analyze systems and system designs for production readiness.

Requirements

  • You have a BS degree (or equivalent professional experience) in Computer Science or related engineering field with proven track record including at least 6 years as an SRE or Software Engineer.
  • You have experience and good habits around building software and tools and getting them adopted. Your system's focus advises a view of code needing to be deeply reliable.

Nice-to-haves

  • Experience writing common programming languages (e.g., Go, Java, C#, Rust).
  • Experience in large project lifecycles.
  • Experience working in sprints, breaking down complex tasks into achievements, and reporting status to keep project scheduling accurate.

Benefits

  • Industry-leading compensation package
  • Excellent medical, dental, and vision coverage
  • A rewarding 401k program
  • Flexible vacation policy (varies by exemption status)
  • Roflex - Flexible and supportive work policy
  • Roblox Admin badge for your avatar
  • Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
  • Onsite fitness center and fitness program credit
  • Annual CalTrain Go Pass
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service