Nvidia - Santa Clara, CA

posted 11 days ago

Full-time - Mid Level
Santa Clara, CA
Computer and Electronic Product Manufacturing

About the position

NVIDIA is seeking a DevOps Engineer to support the DGX Cloud engineering team, which focuses on providing a serverless generative AI infrastructure. The role involves developing and maintaining critical tooling for DGX Cloud services, ensuring timely and quality-assured releases, and automating deployment and management of Kubernetes components. The ideal candidate will thrive in a distributed team environment and possess strong problem-solving skills.

Responsibilities

  • Provide both development and operational tooling critical to DGX Cloud services
  • Implement and operate services used by engineering, including first-level on-call/support
  • Assist engineering by maintaining a well optimized & supported paved road SDLC
  • Ensure coverage of testing from unit testing to CI to smoke-testing to full end to end testing
  • Provide developer environments that are easily updated with a low barrier to entry
  • Develop and maintain continuous integration pipeline templates and testing frameworks
  • Provide and operate continuous testing end-to-end integration environments
  • Automate deployment, config, and management of Kubernetes (K8s) components
  • Work across engineering, testing and SRE to ensure tooling alignment

Requirements

  • Bachelor's or Master's degree in Computer Science, Data Science, or a related field (or equivalent experience)
  • 8+ years of experience in developing devops tooling with a profound passion for automation
  • Solid background in modern source control platforms (GitHub/GitLab)
  • Strong experience in modern CI/CD technologies (Gitlab/testing frameworks/ArgoCD)
  • Proficient in container-based infrastructure (Docker, Kubernetes, Helm)
  • Comprehensive experience with Linux distributions (Ubuntu)
  • Solid background in scripting languages (Bash, Python)
  • Working background in higher level languages (golang)
  • Excellent written and verbal communication skills in English

Nice-to-haves

  • Experience in scaling devops practices across cross-functional teams
  • Demonstrated ability to handle sophisticated technical environments while meeting or exceeding all security, reliability, scalability, and availability metrics
  • Strong and confirmed knowledge of modern architectures at scale

Benefits

  • Competitive salaries
  • Generous benefits package
  • Equity options
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service