Meta - Menlo Park, CA

posted about 1 month ago

Full-time - Intern
Menlo Park, CA
Web Search Portals, Libraries, Archives, and Other Information Services

About the position

Meta is seeking Research Scientist Interns to join the AI & Systems Co-Design HPC & Inference team. The role focuses on driving the definition of next-generation AI Systems Inference and Training architectures, working across various hardware types and workload types. Interns will engage in concurrent design and optimization of software and hardware technologies for AI at datacenter scale, contributing to the development of high-performance systems that support Meta's extensive infrastructure.

Responsibilities

  • Develop tools and methodologies for large scale workload analysis and extract representative benchmarks in C++/Python/Hack.
  • Analyze evolving Meta workload trends and business needs to derive requirements for future offerings.
  • Apply in-depth knowledge of how AI/ML systems interact with compute and storage systems.
  • Utilize understanding of CPUs, GPUs, and systems to identify bottlenecks and enhance efficiency.
  • Collaborate with software developers to re-architect services and improve codebase through algorithm redesign.
  • Identify industry trends and analyze emerging technologies.
  • Conduct prototyping exercises to quantify value propositions for Meta.
  • Influence vendor hardware roadmap to align with Meta's requirements.
  • Work with various engineering teams to deliver the hardware roadmap into production.

Requirements

  • Currently has, or is in the process of obtaining, a PhD degree in Computer Science or a related STEM field.
  • Experience with hardware architecture, compute technologies, and/or storage systems.

Nice-to-haves

  • Intent to return to degree-program after the internship.
  • Track record of achieving results demonstrated by grants, fellowships, patents, or publications at leading workshops or conferences.
  • Architectural understanding of CPU, GPU, Accelerators, Networking, and systems.
  • Experience with large-scale infrastructure and distributed systems.
  • Experience in developing and debugging in C/C++, Python, and/or PyTorch.
  • Experience leading a team in solving analytical problems using quantitative approaches.
  • Interpersonal experience in cross-group and cross-culture collaboration.

Benefits

  • $7,500/month to $11,333/month compensation based on skills and experience.
  • Comprehensive benefits package including health insurance, paid holidays, and professional development opportunities.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service