Meta - Menlo Park, CA

posted about 1 month ago

Full-time - Intern
Menlo Park, CA
Web Search Portals, Libraries, Archives, and Other Information Services

About the position

The Research Scientist Intern position within the AI System SW/HW Co-design team at Meta focuses on exploring and developing high-performance software and hardware technologies for AI at datacenter scale. The role involves optimizing machine learning workloads and collaborating with various engineering teams to enhance performance and efficiency in large-scale Generative AI and ranking/recommendation training jobs. Interns will utilize cutting-edge optimization strategies to maximize training throughput and influence industry partners to align with Meta's infrastructure goals.

Responsibilities

  • Develop tools and methodologies for large scale workload analysis and extract representative benchmarks in C++/Python/Hack.
  • Analyze evolving Meta workload trends and business needs to derive requirements for future offerings.
  • Utilize extensive understanding of CPUs, Flash/HDD storage systems, networking, and GPUs to identify bottlenecks and enhance product/service efficiency.
  • Collaborate closely with software developers to re-architect services, improve codebase through algorithm redesign, reduce resource consumption, and identify hardware/software co-design opportunities.
  • Identify industry trends, analyze emerging technologies and disruptive paradigms, and conduct prototyping exercises to quantify the value proposition for Meta.
  • Work with Software Services, Product Engineering, and Infrastructure Engineering teams to find the optimal way to deliver the hardware roadmap into production.

Requirements

  • Currently pursuing a PhD degree in Computer Science or a related STEM field.
  • Experience with hardware architecture, compute technologies, and/or storage systems.
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment.
  • Intent to return to degree program after the completion of the internship.

Nice-to-haves

  • Track record of achieving results demonstrated by grants, fellowships, patents, or first-authored publications at leading workshops or conferences.
  • Architectural understanding of CPU, GPU, Accelerators, Networking, Flash/HDD Storage systems.
  • Experience with distributed AI training and inference focusing on performance, programmability, and efficiency.
  • Some experience with large-scale infrastructure, distributed systems, and full stack analysis of server applications.
  • Experience or knowledge in developing and debugging in C/C++, Python, and/or PyTorch.
  • Experience driving original scholarship in collaboration with a team.
  • Interpersonal experience in cross-group and cross-culture collaboration.
  • Experience in theoretical and empirical research and for answering questions with research.
  • Experience communicating research for public audiences of peers.

Benefits

  • Competitive monthly compensation ranging from $7,800 to $11,293.
  • Opportunity to work on cutting-edge technologies in AI and systems co-design.
  • Internship duration of 12 to 16 weeks with various start dates throughout the year.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service