Microsoft - Redmond, WA

posted 16 days ago

Full-time - Senior
Remote - Redmond, WA
Publishing Industries

About the position

The Principal Hardware Quality Engineer will play a crucial role in ensuring the quality and reliability of Microsoft's cloud hardware infrastructure. This position involves leading efforts to improve hardware manufacturing processes, conducting failure analysis, and driving continuous improvement initiatives. The engineer will collaborate with diverse teams to resolve technical issues and establish quality metrics, all while fostering a culture of excellence and innovation within the organization.

Responsibilities

  • Develop and implement a robust supplier quality management strategy for data center hardware.
  • Lead quality issues at the system level and conduct debug and failure analysis for issues including GPU in the Azure fleet.
  • Provide system-level technical guidance to stakeholders and lead through complex problems.
  • Drive continuous improvement processes based on Root Cause Analysis (RCA) and identified opportunities.
  • Responsible for quality readouts based on telemetry data analysis, clarifying status and actions across the organization.
  • Establish Critical-to-Quality performance metrics to measure and improve product quality.
  • Act as the voice of quality in the hardware change management process, ensuring quality requirements are met.
  • Mentor and develop team members, fostering a culture of excellence and innovation.

Requirements

  • Bachelor's Degree in Reliability Engineering, Electrical Engineering, or related field AND 8+ years technical engineering experience OR Master's Degree in Reliability Engineering, Electrical Engineering, or related field AND 7+ years technical engineering experience OR Doctorate Degree in Reliability Engineering, Electrical Engineering, or related field AND 5+ years technical engineering experience.
  • 5+ years of experience working with modern server architectures and/or their subsystems including GPU, CPU, AI hardware, Memory, and Motherboards.
  • 3+ years of experience leading a large-scale taskforce to resolve technical problems.

Nice-to-haves

  • Master's degree in Electrical Engineering, Computer HW, or System Engineering.
  • Leadership skills and ability to collaborate with diverse teams and drive a call to action.
  • 10+ years of experience working with modern server architectures and/or their subsystems.
  • 5+ years of experience leading a large-scale taskforce to resolve technical problems.

Benefits

  • Competitive salary range of USD $137,600 - $267,000 per year, with higher ranges in specific locations.
  • Potential eligibility for additional benefits and compensation.
  • Flexible work arrangements, including up to 100% work from home.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service