Software Engineer, SystemML - Scaling / Performance

MetaMenlo Park, CA
459d$146,994 - $208,000

This job is no longer available

There are still lots of open positions. Let's find the one that's right for you.

About The Position

The Software Engineer, SystemML - Scaling / Performance role at Meta involves working within the Network.AI Software team to enhance the software stack around the NVIDIA Collective Communications Library (NCCL). This position focuses on enabling reliable and scalable distributed machine learning (ML) training, particularly for Generative AI (GenAI) and Large Language Models (LLM). The team is responsible for improving the performance and reliability of distributed ML workloads across Meta's extensive GPU infrastructure, contributing to innovations in ML products and applications.

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service