Meta - Menlo Park, CA
posted 2 months ago
The DC Networking team at Meta is responsible for the development, deployment, and operation of the company's global data center networks. This role encompasses the entire network lifecycle, which includes hardware development, capacity planning, and the implementation of both distributed and centralized control systems. The team is engaged in various aspects of network management, including modeling, provisioning, automation, monitoring, troubleshooting, analytics, and simulation/design/failure analysis. We are actively seeking Software Engineers who are passionate about networking and have the aptitude for building scalable distributed systems. This position offers the opportunity to work on one of the most dynamic and fast-paced networks in the world, where you will develop innovative solutions to complex challenges and deploy them into production. As a Software Engineer in Data Center Networking, you will be tasked with designing and implementing drivers and firmware for network ethernet adapter functions, as well as transport stack for RDMA and control functions with the host and accelerators. You will also design and implement platform services that involve programming, monitoring, and controlling various system components such as optics, PHY, FPGAs, sensors, and power management systems. Additionally, you will develop and enhance high-performance computing (HPC) collective communication and parallel computing libraries, including NCCL, RCCL, OneCCL, and MPI. Debugging complex, system-level, multi-component issues that span across multiple layers from kernel to user-mode applications will also be a key responsibility.