Google - Sunnyvale, CA
posted 3 days ago
The Machine Learning III GPU Performance Engineer at Google is responsible for optimizing GPU performance for large language models (LLMs) and ensuring efficient training and serving of these models on a massive scale. This role involves collaborating with product teams to address performance issues, conducting architecture simulations, and analyzing performance metrics to enhance system efficiency.