Principal Systems Engineer

$137,600 - $267,000/Yr

Microsoft - Mountain View, CA

posted about 1 month ago

Full-time - Principal
Mountain View, CA
Publishing Industries

About the position

The Principal Systems Engineer will be a key member of the Azure Cloud Hardware and Infrastructure Engineering (CHIE) team, responsible for designing and deploying hardware systems for Microsoft's Azure Cloud. This role involves collaboration with cross-functional teams to develop innovative hardware solutions that support AI/ML infrastructure, ensuring compatibility with Microsoft Azure datacenter software and meeting customer expectations. The position offers an opportunity to leverage extensive hardware design and validation experience in a cutting-edge public cloud environment.

Responsibilities

  • Collaborate with architecture, silicon engineering, firmware, hardware design, hardware validation, OS (operating systems), manufacturing, and customer teams to build state-of-the-art accelerator hardware solutions.
  • Analyze new interfaces and subsystems to develop integration plans, analyze power efficiency, debug integration issues, and provide recommendations.
  • Define system behavior and concept of operations for the platform to ensure compatibility with Microsoft Azure datacenter software, serviceability, telemetry, and customer expectations.
  • Perform NUDD (new, unique, different and difficult) technology and feature analysis and provide risk assessment and mitigations.
  • Drive technical requirements and ensure the solution is flexible and scalable across the full (HW/FW/SW) stack.
  • Enable platform and solution level discussions, influencing architecture of the product, and delivering to product goals across quality, reliability, and performance.
  • Collaborate with internal, external, and open-source partners to onboard innovative technologies in a seamless manner.

Requirements

  • 10+ years of technical engineering experience OR Bachelor's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 8+ years of technical engineering experience OR Master's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 5+ years of technical engineering experience OR Doctorate degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 4+ years of technical engineering experience.
  • 8+ years of hands-on experience in developing accelerator based HW systems for scale up and scale out data center use cases.
  • Experience with hardware, firmware, management firmware, and/or software (system & application stack) interfaces across all modules in a system.
  • Ability to work across multiple disciplines (hardware, firmware, software, and/or data center infrastructure) to identify risks, drive discussions, detail system tradeoffs, and assess impact.
  • Ability to meet Microsoft, customer and/or government security screening requirements.

Nice-to-haves

  • Hands on experience developing GPU, FPGA based accelerator platforms for AI/ML used cases.
  • Knowledge of high-volume silicon (SoCs, GPUs, or FPGAs), compute, storage, and/or networking design, manufacturing, and deployment.
  • Experience with highspeed interfaces such as PCIe, DDR, and ethernet.
  • In depth experience with operating systems (Windows and/or Linux), system firmware (BIOS, BMC), and system security (hardware and software).
  • Knowledge in virtualization technologies.
  • Knowledge about datacenters & operations at scale.
  • Experience managing hardware programs through the entire product lifecycle.
  • Strong verbal and written communication and presentation skills.

Benefits

  • Health insurance
  • 401k
  • Paid holidays
  • Flexible scheduling
  • Professional development opportunities
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service