Nvidia - Santa Clara, CA
posted 2 months ago
NVIDIA is a world-leader in high speed computer vision, artificial intelligence, and deep learning. Our team builds and optimizes computer vision AI models, SDKs, and cloud services to bring real-time hardware-accelerated AI to data centers, gaming rigs, cars, robots, buildings, medical devices, and more. We are now looking for an expert in system-level software optimization to own the performance of our core computer vision SDKs and services. The right candidate will bring insight into challenges of delivering performance at scale and passion for multi-disciplinary teamwork and efficient, well-crafted software. In this role, you will profile, debug, and optimize data-center and edge computer vision workloads for efficiency, latency, and throughput. You will implement and improve computer vision and image processing algorithms using CUDA, establish and drive product-critical performance metrics, and influence software architecture and technical roadmaps to ensure outstanding performance. Additionally, you will contribute to large codebases combining custom C++ and Python with distributed architectures (microservices, Kubernetes, Triton) to deliver computer vision at scale. You will also provide technical leadership in high-performance computing to computer vision teams across NVIDIA.