Captions - New York, NY

posted 10 days ago

Full-time - Mid Level
New York, NY

About the position

The Machine Learning Engineer will join the AI Research team at Captions, focusing on building the data infrastructure necessary for training advanced video generation models. This role involves developing offline jobs for large generative models, managing training cluster code, and creating data loaders for extensive video datasets. As an early team member, the engineer will have a significant impact on the product and the company's culture, contributing to foundational machine systems.

Responsibilities

  • Design and develop robust data pipelines to support the efficient handling and processing of video data, ensuring high-quality data input for model training.
  • Build and optimize systems for video frame extraction and other pre-processing steps to prepare data for training workflows.
  • Create and manage data loaders for large-scale video datasets, focusing on speed and efficiency to support various model training requirements.
  • Implement feature engineering techniques that enhance data quality and diversity, aiding in model accuracy and performance.
  • Collaborate with research and engineering teams to scale data infrastructure and enable seamless experimentation and model iterations.
  • Write and maintain cluster code to support high-performance training operations, including resource allocation and management.

Requirements

  • Bachelor's or Master's degree in Computer Science, Data Engineering, Machine Learning, or a related field.
  • 3+ years of professional experience in software engineering, data engineering, or ML infrastructure development.
  • Strong programming skills, particularly in Python, with proven experience with data pre-processing and feature engineering, ideally within video or image data contexts.
  • Professional experience working with large-scale data processing frameworks, deep-learning systems, offline model training workflows, data loaders, and cluster infrastructure.

Benefits

  • Comprehensive medical, dental, and vision plans
  • 401K with employer match
  • Commuter Benefits
  • Catered lunch multiple days per week
  • Dinner stipend every night if you're working late and want a bite!
  • Doordash DashPass subscription
  • Health & Wellness Perks (Talkspace, Kindbody, One Medical subscription, HealthAdvocate, Teladoc)
  • Multiple team offsites per year with team events every month
  • Generous PTO policy and flexible WFH days
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service