This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Robloxposted about 2 months ago
$238,520 - $289,460/Yr
Full-time • Mid Level
San Mateo, CA
Resume Match Score

About the position

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone. Are you a seasoned engineer with a passion for ML reliability? We’re looking for exceptional Software Engineers to join the Reliability team at Roblox. In this pivotal role, you will drive the evolution of our ML systems, ensuring they meet the highest standards of performance, reliability, and efficiency. You’ll collaborate with cross-functional teams to build robust ML infrastructure that supports our growth. If you have a track record of solving complex technical challenges, we want to hear from you. Join us in shaping the future of our platform and delivering unparalleled value to our users. At Roblox, our vision is to achieve 1 billion daily active users. We believe this engineer will be instrumental in driving us towards that ambitious goal.

Responsibilities

  • Build, automate and standardize process automation to create a 'golden path' of ML tooling and platform support that powers the ML Roblox ecosystem.
  • Create tooling that provides production guardrails for developing and delivering ML training and inference services to production.
  • Create performance monitoring services and observability towards understanding ML capacity issues and platform degradations.

Requirements

  • You have a BS degree (or equivalent professional experience) in Computer Science or related engineering field with at least 6 years of experience including at least 2 years in SRE or Software Engineering.
  • Deep experience running Kubernetes clusters in production environments at large scale that are on-premise and hosted.
  • Hands on experience with Kubernetes observability, maintenance and upgrades of large scale kubernetes clusters.
  • Experience running ML training and inference workloads on Kubernetes, supporting MLOps frameworks like Kubeflow and working with GPUs.
  • Experience working with popular machine learning frameworks such as TensorFlow or PyTorch.

Nice-to-haves

  • Experience and good habits around building software and tools and getting them adopted.
  • Experience in large project lifecycles.
  • Experience working in sprints, breaking down complex tasks into milestones, and reporting status to keep project scheduling accurate.

Benefits

  • Industry-leading compensation package
  • Excellent medical, dental, and vision coverage
  • A rewarding 401k program
  • Flexible vacation policy (varies by exemption status)
  • Roflex - Flexible and supportive work policy
  • Roblox Admin badge for your avatar
  • Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
  • Onsite fitness center and fitness program credit
  • Annual CalTrain Go Pass

Job Keywords

Hard Skills
  • Go
  • Kubeflow
  • Kubernetes
  • Python
  • PyTorch
  • 9tCOxcv
  • cZrMj p53Ay
  • DndTB9OG IvjPfR9mbt
  • gd1nPSaCu4U7 cpKhPbLj7W
  • IA6OBZ7CU Uv0dwDg4oCPE
  • irgxCn92 W2CL6KINp
  • MWpSHxOYkoq FGWXiljIVaHLv
  • nk6xUR5ZS jYq6y1wEgmbz
  • QU2grHFORq0WjG YJFaCfQ0 OrIk CpGEq8tJM5yr
  • RQvLOW kNIzF97Ac2f
  • ScLn54rzjl NDloZ2MqCG7h
  • SGaj3FYp zN0PDTm Vn5wuUXB
  • Y9RVwtQuLny
Soft Skills
  • aRXGt wqct3lH9X
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service