Amazon - Seattle, WA

posted 4 days ago

Full-time - Mid Level
Seattle, WA
Sporting Goods, Hobby, Musical Instrument, Book, and Miscellaneous Retailers

About the position

The Machine Learning Engineer (L5) role at AWS Neuron focuses on developing and optimizing distributed inference solutions for large-scale machine learning models. This position involves working closely with compiler and runtime engineers to enhance the performance of ML models on AWS Trainium and Inferentia hardware. The role requires strong software development skills and a deep understanding of machine learning frameworks.

Responsibilities

  • Lead efforts to build distributed inference support into Pytorch and Tensorflow using XLA and the Neuron compiler.
  • Tune ML models for optimal performance on AWS Trainium and Inferentia silicon.
  • Design and code solutions to improve software architecture and drive efficiencies.
  • Create metrics and implement automation to enhance processes.
  • Participate in design discussions and code reviews, collaborating with stakeholders.
  • Work cross-functionally to influence business decisions with technical insights.

Requirements

  • 3+ years of professional software development experience.
  • 2+ years of experience in design or architecture of systems.
  • Proficiency in at least one programming language, preferably C++ or Python.

Nice-to-haves

  • 3+ years of experience in the full software development life cycle, including coding standards and testing.
  • Bachelor's degree in computer science or equivalent.

Benefits

  • Competitive salary based on geographic location and experience.
  • Equity and sign-on payments may be included in the compensation package.
  • Comprehensive medical, financial, and other benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service