Anthropic - San Francisco, CA

posted 4 days ago

Full-time - Mid Level
San Francisco, CA

About the position

As a Machine Learning Infrastructure Engineer on the Core Resources team at Anthropic, you will be responsible for optimizing the infrastructure that supports AI model training. Your role will focus on enhancing the performance, robustness, usability, and efficiency of systems that enable breakthroughs in AI capabilities and safety. You will work collaboratively with researchers to ensure that their needs are met, allowing for rapid progress in AI research while maximizing resource efficiency.

Responsibilities

  • Optimize infrastructure for AI model training and research.
  • Improve performance, robustness, usability, and efficiency of systems.
  • Collaborate with research teams to support their needs and enhance productivity.
  • Identify and scope efficiency opportunities in collaboration with partner teams.
  • Build telemetry and reporting systems for compute occupancy and utilization metrics.

Requirements

  • 8+ years of software engineering experience.
  • Experience with high performance, large scale distributed systems.
  • Proficiency in Kubernetes and Python.
  • Familiarity with machine learning and LLM inference.

Nice-to-haves

  • Experience with pair programming.
  • Interest in learning more about machine learning research.
  • Ability to work cross-functionally with finance and other business-facing teams.

Benefits

  • Comprehensive health, dental, and vision insurance for you and all your dependents.
  • 401(k) plan with 4% matching.
  • 22 weeks of paid parental leave.
  • Unlimited PTO - most staff take between 4-6 weeks each year.
  • Stipends for education, home office improvements, commuting, and wellness.
  • Fertility benefits via Carrot.
  • Daily lunches and snacks in the office.
  • Relocation support for those moving to the Bay Area.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service