Microsoft - Mountain View, CA

posted about 1 month ago

Full-time - Principal
Mountain View, CA
Publishing Industries

About the position

The Principal Systems Engineer will be a key member of the Azure Cloud Hardware and Infrastructure Engineering (CHIE) team, responsible for developing and delivering hardware designs for Microsoft's Azure Cloud. This role involves collaboration with cross-functional teams to create innovative end-to-end hardware solutions that support Azure AI/ML infrastructure, ensuring compatibility with Microsoft Azure datacenter software and customer expectations.

Responsibilities

  • Collaborate with architecture, silicon engineering, firmware, hardware design, hardware validation, OS, manufacturing, and customer teams to build state-of-the-art accelerator hardware solutions.
  • Analyze new interfaces and subsystems to develop integration plans, analyze power efficiency, debug integration issues, and provide recommendations.
  • Define system behavior and concept of operations for the platform to ensure compatibility with Microsoft Azure datacenter software, serviceability, telemetry, and customer expectations.
  • Perform NUDD (new, unique, different and difficult) technology and feature analysis and provide risk assessment and mitigations.
  • Drive technical requirements and ensure the solution is flexible and scalable across the full (HW/FW/SW) stack.
  • Enable platform and solution level discussions, influencing architecture of the product, and delivering to product goals across quality, reliability, and performance.
  • Collaborate with internal, external, and open-source partners to onboard innovative technologies in a seamless manner.

Requirements

  • Proven experience in hardware design and validation.
  • Strong understanding of system architecture and integration processes.
  • Experience with power efficiency analysis and debugging integration issues.
  • Ability to define system behavior and operational concepts for complex platforms.
  • Experience in risk assessment and mitigation strategies for new technologies.
  • Strong collaboration skills to work with cross-functional teams and partners.

Nice-to-haves

  • Experience with AI/ML infrastructure and technologies.
  • Familiarity with Microsoft Azure services and datacenter operations.
  • Knowledge of open-source technologies and their integration into hardware solutions.

Benefits

  • Health insurance coverage
  • 401k retirement savings plan
  • Paid holidays and vacation time
  • Professional development opportunities
  • Flexible work hours
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service