Outscal Technologies - New York, NY

posted 8 days ago

Full-time - Senior
New York, NY

About the position

The Research Scientist position in GenAI at Outscal Technologies focuses on developing multimodal generative foundation models, particularly in the audio domain, which includes speech, sound, and music. The role involves conducting full life-cycle research, designing and implementing models, and collaborating with cross-functional teams to achieve high-level goals. The successful candidate will contribute to cutting-edge research that impacts billions of users globally.

Responsibilities

  • Conduct full life-cycle research on multimodal generative foundation models with a focus on audio modalities.
  • Design and implement models and algorithms for audio generation.
  • Collect and select training data, train, tune, and scale models, and evaluate their performance.
  • Collaborate with language and vision research teams to leverage expertise and achieve project goals.

Requirements

  • Bachelor's degree in Computer Science, Computer Engineering, or a relevant technical field.
  • PhD in a related field with 3+ years of experience, or a BS degree with 5+ years of industrial research experience.
  • Solid track record of research in audio or vision domains.
  • Proven knowledge in neural networks and experience with ML frameworks like PyTorch, TensorFlow, or JAX.
  • Proficiency in Python programming language.
  • Strong communication skills.

Nice-to-haves

  • Solid publication track record in related fields.
  • Experience in audio dataset curation, model scaling, or audio generation model evaluation.
  • Experience in large-scale data processing.
  • Ability to solve complex problems involving trade-offs and cross-functional collaboration.

Benefits

  • Bonus
  • Equity
  • Comprehensive benefits package
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service