Meta - Menlo Park, CA
posted 5 months ago
AI Training and Inference is a core pillar of Meta's success. To achieve Meta's AI goals, the network infrastructure, from the networking software stack through to the network switches, must operate with a high level of reliability. Production Engineers play a key role in driving the reliability of this network by deep diving into production issues through the entire stack and building software systems to ensure that operations can be scaled appropriately. To support delivering on these goals, Production Engineering Managers play a critical role in supporting and growing the organization to ensure the success of shared goals across the domain. As a Manager of Production Engineering (Network), you will support and lead engineers who are responsible for reliably scaling Meta's AI/HPC networking operations. You will partner with teams across Meta's AI/HPC environment to ensure alignment on operational priorities and approaches across the domain. Your role will involve understanding and contributing to technical architectures, capacity plans, tooling needs, automation plans, product launch plans, and creating comprehensive plans for prioritizing technical and resourcing challenges. You will drive technical architecture discussions, even on subjects you haven't had direct experience working with, and help define and drive a technical roadmap to meet organizational objectives. In addition, you will help engineers develop their careers by assigning them to projects tailored to their skill levels, long-term skill development, personalities, and work styles. You will also play a vital role in building and enriching an inclusive work environment comprised of people from diverse backgrounds. Regular assessment of employee performance, addressing under-performance, and recognizing and promoting performance will be part of your responsibilities. Balancing the need to keep operations running with allocating time to long-term, high-impact projects will be crucial in this role.