This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Nvidia - Santa Clara, CA

posted 3 months ago

Full-time - Senior
Santa Clara, CA
Computer and Electronic Product Manufacturing

About the position

The Distinguished Engineer - Data Center System Software Architect at NVIDIA is responsible for the end-to-end architecture of data center systems, including firmware, kernel drivers, operating systems, and user mode drivers. This role involves collaborating with internal teams and industry-leading cloud service providers to align product roadmaps and drive the adoption of new technologies and protocols. The architect will also mentor engineering teams and make critical technical decisions to mitigate execution risks.

Responsibilities

  • Drive the system architecture for a complex server platform in a cross-functional environment.
  • Work directly with major customers to understand their requirements and align their roadmap with NVIDIA's roadmap.
  • Collaborate with business partners and vendors to shape their products to meet NVIDIA's needs.
  • Develop a roadmap of new technologies and protocols and drive their design and adoption.
  • Mentor architects and engineering teams to grow them into future leaders.
  • Make key technical decisions even when faced with ambiguity, and mitigate execution risks by following left shift strategy.

Requirements

  • Deep experience in designing architecture for scalable and performant server systems, particularly at the SW/HW interface.
  • Previous experience working with complex system software for accelerators such as GPUs, DPUs, or FPGAs.
  • Expertise in Out of Band and Inband management architectures.
  • Knowledge of device management protocols such as MCTP, PLDM and RDE.
  • Knowledge of system management protocols such as Redfish and IPMI.
  • Experience working with platform security experts to define tradeoffs between security and ease of use.
  • Demonstrable experience in implementing left shift strategy to de-risk program execution.
  • Excellent written and verbal communication skills.
  • BS or MS degree in Computer Engineering, Computer Science, or related degree or equivalent experience.
  • 20+ years in the area of System architecture and design.

Nice-to-haves

  • Knowledge of cloud and cluster level deployment and management systems.
  • Participation and contributions in standards bodies such as OCP and DMTF.
  • Familiarity with CXL architectures.
  • Knowledge in storage and networking technologies.

Benefits

  • Equity options
  • Comprehensive health insurance
  • Retirement savings plan
  • Paid time off
  • Flexible work hours
  • Professional development opportunities
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service