American IT Systems - Redmond, WA

posted 4 days ago

Full-time - Mid Level
Redmond, WA

About the position

The AI Engineer - Machine Learning 2 role is focused on advancing the development and improvement of software foundations and tools essential for training state-of-the-art AI models, particularly in robotics. The position involves creating scalable and efficient training infrastructures, collaborating with researchers and software engineers, and ensuring the smooth integration and functionality of training systems. The role emphasizes the importance of experience in robotics or reinforcement learning, aiming to expand the capabilities of AI in practical applications.

Responsibilities

  • Create and uphold efficient, scalable, and distributed training systems including data preprocessing, training orchestration, and model assessment for training large-scale AI models.
  • Enhance the efficiency of training procedures to improve performance and use of resources, while maintaining scalability and dependability.
  • Collaborate with researchers to create training and evaluation pipelines for state-of-the-art algorithms.
  • Develop and design benchmarks for evaluating ML models.
  • Perform training and fine-tuning of foundation models for robotic applications.
  • Monitor and analyze pipelines, identifying bottlenecks and proposing solutions to improve efficiency and performance.
  • Ensure the robustness and reliability of the training infrastructure, including automated testing and continuous integration.

Requirements

  • BS, MS or higher degree in Computer Science, Robotics, Engineering or a related field, or equivalent practical experience.
  • Proficiency in Python, C++, or similar and at least one deep learning library such as PyTorch, TensorFlow, JAX, etc.
  • Strong background in distributed computing, parallel processing techniques, handling large-scale datasets and data preprocessing.
  • Deep understanding of state-of-the-art machine learning techniques and models.
  • Experience with cloud-based training environments (AWS, Google Cloud, Azure).
  • Experience in developing and maintaining software tooling and infrastructure for machine learning.
  • Deep understanding and practical experience with software engineering principles, including algorithms, data structures, and system design.
  • Experience with continuous integration and automated testing frameworks.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service