Nvidia - Santa Clara, CA

posted 23 days ago

Full-time - Senior
Santa Clara, CA
Computer and Electronic Product Manufacturing

About the position

The Senior Software Architect role at NVIDIA focuses on advancing AI and high-performance computing (HPC) through innovative software architecture. The position involves researching, developing, and deploying solutions in networking hardware, programming environments, and system software to enhance the performance and scalability of high-end computer systems. The role is set within a diverse and supportive environment that encourages creativity and collaboration among talented individuals.

Responsibilities

  • Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO) and HPC programming models (MPI, OpenSHMEM, PGAS).
  • Researching, designing, and implementing features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM) and Deep Learning frameworks such as TensorFlow/PyTorch.
  • Reviewing, designing, and implementing features to enhance compiler features to support the NVIDIA networking ecosystem.
  • Researching, designing, and developing hardware features relevant to scientific, Deep Learning, and data-intensive workloads.

Requirements

  • Ph.D, Masters, or Bachelors in computer science, computer engineering, or a closely related field or equivalent experience.
  • 8+ years of experience in parallel programming models and/or network architecture.
  • Background in algorithm design, system programming, and computer architecture.
  • Strong programming and software development skills.
  • Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.

Nice-to-haves

  • Deep understanding of technology and passion for what you do.
  • Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment.
  • Background with designing communication middleware for high-performance computing systems, including InfiniBand, DPUs, Ethernet, and Shared Memory.
  • Experience developing and implementing features for compilers, optimizations for compilers, particularly Clang/LLVM, and NVIDIA compilers, and experience implementing communications libraries, particularly MPI, OpenSHMEM, NCCL, NVSHMEM, UCX, UCC, or PGAS.
  • Background with CUDA programming and NVIDIA GPUs and programming models for emerging architectures including hierarchical heterogeneous memory systems and accelerators.

Benefits

  • Competitive salaries
  • Generous benefits package
  • Equity options
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service