Microsoft - Raleigh, NC

posted 2 months ago

Full-time - Senior
Raleigh, NC
Publishing Industries

About the position

Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive, and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate engineers to help achieve that mission. As Microsoft's cloud business continues to grow the ability to deploy new offerings and hardware infrastructure on time, in high volume with high quality and lowest cost is of paramount importance. To achieve this goal, the Hardware, Infrastructure Management, and Fundamentals Engineering (HIFE) team is instrumental in defining and delivering operational measures of success for hardware manufacturing, improving the planning process, quality, delivery, scale and sustainability related to Microsoft cloud hardware. We are looking for seasoned engineers with a dedicated passion for customer focused solutions, insight and industry knowledge to envision and implement future technical solutions that will manage and optimize the Cloud infrastructure. We are looking for a Principal Hardware Quality Engineer to join the team. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

  • Hands on debug in data center (onsite and virtual)
  • Develop and implement a robust supplier quality management strategy to ensure the data center hardware is manufactured at the highest level of quality standards.
  • Leadership to work across data centers, development, and supplier to resolve critical & high severity issues.
  • Conduct hands on debug in global data centers (onsite and virtual) including GPU sub-system failure analysis.
  • Drive the continuous improvement process based on Root Cause Analysis (RCA) and identified opportunities.
  • Manage multiple NPI builds and quality phase-gate deliverables for the manufacturing team throughout the engineering development lifecycle, from concept through production readiness.
  • Establish Critical-to-Quality performance metrics to measure and improve product quality.
  • Act as the voice of quality in the hardware change management process, ensuring quality requirements are considered and met.
  • Embody our culture and values.

Requirements

  • Bachelor's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 8+ years technical engineering experience.
  • OR Master's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 7+ years technical engineering experience.
  • OR Doctorate Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 5+ years technical engineering experience.
  • 8+ years of experience in working with modern server architectures including GPU, Artificial Intelligence hardware, Memory or CPU and methods for failure analysis, debugging or validation.
  • 8+ years of direct engineering experience in hardware system issue resolution in Data Centers.
  • 3+ years of experience in leading a large-scale taskforce to resolve technical problems and solutions.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.

Nice-to-haves

  • Master's degree in Electrical Engineering, Software Engineering, Or System Engineering.
  • 12+ years of experience in working with the modern server architectures - includes understanding of GPU, Artificial Intelligence hardware, Memory or CPU and methods for failure analysis, debugging or validation.
  • 12+ years of proven success of leading resolution of critical quality issues across data centers.

Benefits

  • Health insurance
  • Dental insurance
  • 401k
  • Paid holidays
  • Flexible scheduling
  • Professional development
  • Tuition reimbursement
  • Employee stock purchase plan
  • Performance bonus
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service