Nvidia - Austin, TX
posted 7 days ago
NVIDIA is seeking a highly skilled and experienced Staff Software Engineer to lead the design, deployment, and management of large-scale GPU clusters that power AI workloads across multiple teams and projects. This role is crucial for ensuring the efficiency, scalability, and reliability of GPU clusters, which significantly impact the future of machine learning and artificial intelligence at NVIDIA. The ideal candidate will have a passion for operational excellence and automation, working in a multi-cloud environment, and collaborating with a diverse team to improve infrastructure provisioning and resiliency.