Tesla - Palo Alto, CA
posted 10 days ago
As a Site Reliability Engineer on Tesla's Supercomputing/AI infrastructure team, you will play a crucial role in maintaining and enhancing the platform that supports Full-Self-Driving (FSD), Tesla Bot, and Dojo engineering teams. Your responsibilities will include managing AI infrastructure, monitoring performance metrics, troubleshooting Linux systems, and ensuring security, all aimed at facilitating neural network training at scale and optimizing compute resources.