Roblox - San Mateo, CA

posted 5 months ago

Full-time - Manager
San Mateo, CA
Professional, Scientific, and Technical Services

About the position

At Roblox, we are on a mission to connect a billion people with optimism and civility through immersive digital experiences. As an Engineering Manager in Software Reliability Engineering, you will play a pivotal role in shaping the future of human interaction by working closely with cross-functional product partners to enhance reliability across our platform. Your responsibilities will include hands-on involvement with production systems, where you will analyze key trends and insights to ensure the health and reliability of our services. You will be tasked with building production guardrails that allow our services to scale reliably, demonstrating a deep commitment to production health and operational excellence. In this role, you will report directly to the Director of Reliability Engineering and will be responsible for leading a strong team of engineers. Your leadership will foster a culture of problem-solving and innovation, enabling your team to tackle unique technical challenges at scale. You will also be expected to engage in hands-on coding, leveraging your technical skills to guide your team in achieving their objectives. Your experience in engineering management, particularly in running Site Reliability Engineering (SRE) teams, will be crucial as you assist engineering teams in designing and implementing scalable services across various infrastructure tiers, including orchestration and service discovery. This position is hybrid, requiring you to be in the San Mateo headquarters three days a week, allowing for collaboration and engagement with your team while also providing flexibility for remote work on other days. You will be part of a dynamic environment where your contributions will directly impact the way millions of users interact with our platform.

Responsibilities

  • Work with cross-functional product partners to enhance reliability.
  • Engage hands-on with production systems to understand key trends and insights.
  • Build production guardrails for services to ensure reliable scaling.
  • Care deeply about the health of production systems.

Requirements

  • Experience building and growing a strong engineering team.
  • Hands-on coding experience with the ability to write code.
  • Knowledge of systems at scale and large-scale distributed infrastructure.
  • 4+ years of experience in engineering management.
  • Customer, team, and quality-oriented skills.
  • Experience assisting engineering teams with scalable design and implementation of services.
  • Experience running Site Reliability Engineering (SRE) teams.
  • Bachelor's degree in Computer Science or equivalent experience.

Nice-to-haves

  • Strong project management skills and strategic planning abilities.

Benefits

  • Industry-leading compensation package
  • Excellent medical, dental, and vision coverage
  • A rewarding 401k program
  • Flexible vacation policy
  • Flexible and supportive work policy (Roflex)
  • Roblox Admin badge for your avatar
  • Free catered lunches five times a week
  • Fully stocked kitchens with unlimited snacks
  • Onsite fitness center and fitness program credit
  • Annual CalTrain Go Pass
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service