Nvidia - Santa Clara, CA

posted about 1 month ago

Full-time - Mid Level
Santa Clara, CA
Computer and Electronic Product Manufacturing

About the position

NVIDIA is seeking a highly skilled Large Language Model (LLM) based Application Infrastructure engineer to design, develop, and maintain infrastructure for internal large language models that facilitate chip design. This role involves collaboration with hardware engineers and LLM research teams to ensure the infrastructure meets the specific needs of GPU design, while optimizing for performance, scalability, and reliability.

Responsibilities

  • Develop and maintain the infrastructure for managing large language models (LLMs) specifically adapted for the chip design and hardware domain.
  • Develop and maintain LLM based applications to serve hardware engineers, such as LLM based QA bot and code generator.
  • Collaborate with HW chip designers and LLM research teams to understand the specific needs and challenges of GPU design.
  • Collect and organize training/fine-tuning data to train hardware specific language models in collaboration with LLM research teams.
  • Optimize the infrastructure for performance, scalability, and reliability, ensuring secure and efficient data management.
  • Stay updated with the latest industry trends in AI and machine learning, looking for opportunities to improve the LLM infrastructure.

Requirements

  • BS in computer science or related field or equivalent experience.
  • 5+ years of experience in developing and maintaining AI or machine learning infrastructure, preferably with large language models.
  • Strong proficiency in Python and web development, familiarity with LLM related techniques such as langchain, vector database, and prompt engineering.
  • Understanding of chip design and related computational and data challenges.
  • Experience with data management, including document cleaning, transformation, and secure storage.
  • Excellent problem-solving skills and ability to work effectively in a team.
  • In-depth understanding of Machine Learning, Deep Learning, and NLP concepts.

Nice-to-haves

  • Experience in crafting and developing production quality microservices.
  • Strong technical background in cloud/distributed infrastructure.
  • Familiarity with front-end development using React or Vue.js.
  • Strong understanding of SQL and NoSQL data platforms.

Benefits

  • Highly competitive salaries
  • Comprehensive benefits package
  • Equity options
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service