AMD - Austin, TX

posted about 2 months ago

Full-time - Senior
Austin, TX
Computer and Electronic Product Manufacturing

About the position

The Fellow - Data Center GPU Systems Design Validation Architect role at AMD involves leading the validation efforts for advanced GPU systems, particularly in AI and HPC domains. This position focuses on ensuring robust system-level integration and validation, with a hands-on approach to testing and problem-solving. The architect will orchestrate comprehensive validation strategies, working closely with various teams to identify and mitigate potential product weaknesses, ultimately contributing to the development of cutting-edge data center solutions.

Responsibilities

  • Orchestrating the development and implementation of advanced validation strategies to identify potential product weaknesses.
  • Leading a team in pioneering technical validation initiatives focusing on PCIe, HBM, and SMC/BMC firmware.
  • Creating and executing validation test plans that address functional and stress scenarios, including emulation of end-customer systems.
  • Ensuring compliance with OCP standards and secure solution development, including Out of Band Management and Redfish features.
  • Collaborating with multiple teams to devise and execute exhaustive validation test plans simulating real-world stress scenarios and customer workloads.
  • Championing the process of debugging, root cause analysis, and resolution of issues discovered during validation phases.
  • Working closely with development teams to ensure all identified issues are addressed before production.
  • Advancing end-to-end validation test content utilizing creative debugging skills.

Requirements

  • Proficiency in programming/scripting languages (e.g., C/C++, Perl, Ruby, Python).
  • Expertise in state-of-the-art debugging techniques and methodologies.
  • Extensive experience with lab equipment such as protocol/logic analyzers and oscilloscopes.
  • Deep knowledge in board/platform-level debug, including delivery, sequencing, analysis, and optimization.
  • Comprehensive understanding of system architecture, focusing on technical debug and validation strategy development.
  • Exceptional analytical and problem-solving skills with meticulous attention to detail.
  • Self-driven with the ability to lead tasks independently to successful completion.

Benefits

  • Base pay depending on skills, qualifications, experience, and location.
  • Eligibility for annual bonuses or sales incentives.
  • Opportunity to own shares of AMD stock.
  • Discount when purchasing AMD stock through the Employee Stock Purchase Plan.
  • Competitive benefits package.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service