Meta - Menlo Park, CA

posted 4 months ago

Full-time - Senior
Menlo Park, CA
Web Search Portals, Libraries, Archives, and Other Information Services

About the position

The Generative AI Org at Meta is seeking a strong technical leader to join our team and work on the next generation of large language models, particularly focusing on building new capabilities for Llama inspired by internal use cases. As a technical leader, you will play a critical role in building our series of efficient Llama models and building new capabilities on top of them. You will work with internal clients to understand their needs and push the boundaries of text LLMs via breakthroughs in several capabilities. This position requires a deep understanding of both the technical and managerial aspects of AI research and development, as you will be leading a team of applied researchers and collaborating with various stakeholders across the organization. In this role, you will drive efficiency gains on training and deployment of LLMs through novel techniques, oversee the end-to-end development of LLM models, including data sourcing and curation, filtering, experiment design, evaluation, and more. You will also be responsible for communicating, collaborating, and building relationships with clients and peer teams to facilitate cross-functional projects. Staying up-to-date on ongoing research and software development activities in the team is crucial, as is helping to work through technical challenges and being involved in design decisions. Additionally, you will remain deeply involved in the research community, both understanding trends and setting them, ensuring that Meta remains at the forefront of generative AI advancements.

Responsibilities

  • Drive efficiency gains on training and deployment of LLMs through novel techniques
  • Drive end-to-end development of LLM models, including data sourcing and curation, filtering, experiment design, evaluation and more
  • Lead a team of applied researchers to democratize Llama for Meta's users
  • Communicate, collaborate, and build relationships with clients and peer teams to facilitate cross-functional projects
  • Remain up-to-date on ongoing research and software development activities in the team, help work through technical challenges, and be involved in design decisions
  • Remain deeply involved in the research community, both understanding trends, and setting them

Requirements

  • 5+ years of hands-on experience in large language model, NLP, and Transformer modeling, in the setting of both research and engineering development
  • Experience and track record in landing large research and/or product impacts in a fast-paced environment
  • 3+ years of hands-on supporting and leading teams of research scientists and software engineers
  • Proven technical vision in where the field of generative AI will go
  • Experience of and knowledge of model efficiency techniques (quantization, distillation, etc.)
  • Experience with cross functional collaboration with product and platform teams, as well as non-engineering functions
  • Demonstrated experience recruiting, building, structuring, leading technical organizations, including performance management

Nice-to-haves

  • PhD in deep learning, artificial intelligence, and/or related technical field
  • Experience and knowledge of ML frameworks like PyTorch, TensorFlow, etc.
  • Experience and knowledge of large-scale data platforms such as Spark, Hive, etc.
  • Experience and knowledge of working with LLM frameworks like LangChain
  • Experience and knowledge of training LLMs, fine-tuning on datasets, especially LLaMa

Benefits

  • Competitive salary ranging from $177,000 to $251,000 per year
  • Bonus opportunities
  • Equity options
  • Comprehensive benefits package
  • Flexible work arrangements
  • Professional development opportunities
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service