AMD - Boxborough, MA

posted 5 months ago

Full-time - Mid Level
Boxborough, MA
Computer and Electronic Product Manufacturing

About the position

At AMD, we are committed to transforming lives through our technology, and we are looking for an experienced AI Application Engineer to join our HPC Centre of Excellence (HPC CoE). This role is not entry-level and requires significant professional engineering experience along with an advanced technical degree. The successful candidate will work closely with customers and partners to support RFP-driven requests, providing hands-on assistance to ensure their applications run efficiently on AMD hardware. This includes enabling AI workloads on AMD GPUs and CPUs, particularly the Instinct and EPYC series, and understanding the performance characteristics of various training and inference workloads. The role involves broad engineering investigations to assess performance across popular and customer-specific workloads, as well as competitive positioning. You will be responsible for creating comprehensive technical documentation regarding AI performance on AMD hardware, which will support our Field Application team, partners, and customers. This is a hands-on technical position, and we are looking for someone with a solid background in AI, who is adept at executing and tuning workloads. Additionally, the role requires the ability to create and deliver presentations and training sessions, both remotely and in person, necessitating some global travel (approximately 20%). As AMD ramps up its in-house AI expertise, this position offers a unique opportunity for growth and significant impact within the organization, with visibility to senior management. The role is primarily based in North America but is also open to qualified candidates in Europe.

Responsibilities

  • Support winning new AI business.
  • Enable customers to execute their AI workloads on AMD GPUs and CPUs (principally Instinct and EPYC).
  • Support partners in RFP responses by testing requested workloads.
  • Execute popular and customer-driven AI inference and training workloads, generating results and creating a characteristic understanding of AI performance on AMD hardware.
  • Understand how system and software choices affect performance.
  • Compare performance to our competition.
  • Run training and inference performance investigations using common frameworks (Pytorch, Tensorflow, JAX) and repositories (MLperf, Hugging Face).
  • Build a body of documentation for internal and external dissemination: AMD-internal guides, whitepapers, tuning guides, training collateral.
  • Liaise and advise customers and partners through Proof of Concepts, presentations, and training.
  • Assist or lead efforts to port applications to different frameworks or change elements within the software.
  • Create scripts and tools to enable a fast start for customers and developers.
  • Engage actively across AMD teams: GPU Business Unit, Engineering, Architecture, Platform, Software, and Product Development teams, providing feedback and leadership from the field on requirements.
  • Assist in creating TCO models to assist pricing with the bid desk.
  • Technically own and resolve customer and partner issues, submitting JIRA tickets and driving resolution.
  • Automate repeatable procedures.

Requirements

  • Significant professional engineering experience in AI applications.
  • Advanced technical degree in a relevant field.
  • Demonstrable hands-on expertise with popular AI frameworks.
  • Strong positive attitude and willingness to lead by example.
  • Excellent verbal and written communication skills.
  • Ability to prioritize opportunities and deliver results on time.
  • Fluency in English and the right to remain in the USA/Europe.

Nice-to-haves

  • Hands-on AI experience within automotive, finance, enterprise, or defense verticals.
  • Programming experience with HIP, CUDA, Python, C/C++, Fortran, OpenACC, OpenMP, or pSTL.
  • Understanding the impact of inter-node network choices on performance at scale.
  • Experience creating performance projections for applications using Deep Neural Networks.
  • Familiarity with assembly language and memory/cache hierarchy.
  • Government level security clearance.

Benefits

  • Base pay dependent on skills, qualifications, experience, and location.
  • Eligibility for annual bonuses or sales incentives.
  • Opportunity to own shares of AMD stock and discounts on stock purchases through the Employee Stock Purchase Plan.
  • Competitive benefits package.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service