AMD - San Jose, CA

posted about 1 month ago

Full-time - Mid Level
San Jose, CA
Computer and Electronic Product Manufacturing

About the position

The Sr. Machine Learning Performance Engineer at AMD is responsible for ML performance modeling, projection, and optimization for various ML workloads. This role involves analyzing the interaction between ML workloads and hardware architecture, particularly focusing on generative AI models across multiple hardware configurations. The engineer will collaborate with customers and business units to project performance, analyze results, and develop solutions to meet customer needs, ultimately shaping the future of AI acceleration.

Responsibilities

  • Performance modeling and analysis of ML training and inference workloads across single and multiple accelerators.
  • Explore various tradeoff and design decisions.
  • Participate in hardware-software co-design for future hardware optimization on various ML workloads.
  • Communicate and present the results of the performance analysis and modeling to stakeholders and provide concrete recommendations.
  • Develop and improve our framework, tools and infrastructure for performance estimation, modeling and reporting.
  • Cross team collaboration.

Requirements

  • Strong technical expertise and experience in performance analysis, projection, and hardware architecture.
  • Excellent written, verbal, and presentation skills.
  • Experienced in C++ coding.
  • PhD or master's degree in computer science, electrical engineering, or a related field.

Benefits

  • Base pay dependent on skills, qualifications, experience, and location.
  • Eligibility for annual bonus or sales incentive.
  • Opportunity to own shares of AMD stock.
  • Discount on AMD stock through Employee Stock Purchase Plan.
  • Competitive benefits package.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service