Meta - Bellevue, WA
posted 3 months ago
Meta is seeking a Research Scientist to join our Research & Development teams, focusing on AI Infrastructure. The ideal candidate will have industry experience working on AI Infrastructure related topics. This position involves applying these skills to solve some of the most crucial and exciting problems that exist on the web. We are hiring in multiple locations, and the Kernel team is dedicated to maximizing the inference performance for Generative AI and Recommendation models by developing high-performance kernels. Our expertise lies in creating specialized kernels that significantly improve the efficiency of these models. We have successfully developed and deployed the first FP8 kernel in Meta's production, as well as FBGEMM TBE. By continuously advancing our kernel optimization capabilities, we enable better user experiences and drive innovation in the field of Generative AI and Recommendation systems. The E2E Performance team is dedicated to optimizing the end-to-end performance of Generative AI and Recommendation models. We employ various parallelism strategies and distributed inference techniques to enhance TTIT and TTFT for LLM and LDM. By relentlessly pursuing performance improvements, we have achieved notable successes such as enabling the utilization of AMD GPUs for GenAI production applications and subsequently optimizing their performance. Our ongoing efforts ensure the continuous betterment of these models' performance, ultimately providing more responsive and seamless experiences for users interacting with Generative AI.