Nvidia - Santa Clara, CA

posted 24 days ago

Full-time - Senior
Santa Clara, CA
5,001-10,000 employees
Computer and Electronic Product Manufacturing

About the position

The Senior Technical Program Manager will lead the strategy and execution of programs to support the bringup, operations, and automation of GPU infrastructure at NVIDIA. This role is crucial for enabling advanced AI and hardware research, ensuring high-quality outcomes and operational excellence in a fast-paced environment. The TPM will collaborate with internal and external partners to scale operations, develop standardized methodologies, and drive continuous improvement across engineering efforts.

Responsibilities

  • Engage with cross-company partners to shape the technical strategy and coordinate execution to meet key business objectives.
  • Nurture a culture of continuous improvement by identifying new opportunities across tooling, automation, and processes.
  • Guide engineering efforts using agile program methodologies across planning, prioritization, design, dependency management, implementation, and execution.
  • Implement a data-first approach to measure program success through metrics, OKRs, and KPIs.
  • Create effective communication channels to provide insights into program status, risks, and opportunities for various audiences.
  • Act as a liaison between developers, customers, and partners to drive organizational alignment.

Requirements

  • B.S. in Computer Science or a related technical discipline or equivalent experience.
  • 12+ years of experience in software engineering and/or technical program management roles.
  • Demonstrated expertise in infrastructure software, production application software development, and large-scale distributed computing.
  • Experience managing large-scale HPC and/or AI infrastructure deployments.
  • Exceptional communication and presentation skills for diverse audiences.
  • Strong multitasking abilities with a focus on thoroughness and rapid context switching.
  • Knowledge of agile methodologies and project management tools.
  • Proactive in identifying and implementing positive changes in software engineering and release management.

Nice-to-haves

  • Prior experience bringing up new datacenter capacity across cloud service providers and on-premise locations.
  • Experience migrating platforms and solutions from on-prem to cloud.
  • Experience working with AI researchers and/or EDA developers.
  • Familiarity with software development, release, and support methodology and DevOps.

Benefits

  • Equity options
  • Comprehensive health benefits
  • Flexible work hours
  • Paid time off
  • Retirement savings plan
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service