Tiktok - San Jose, CA

posted 2 days ago

Full-time - Entry Level
San Jose, CA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

As a Research Scientist in Intelligent Editing at TikTok, you will be at the forefront of cutting-edge research and development in the fields of computer vision and machine learning. This role focuses particularly on multi-modal understanding, which encompasses the integration of vision and language, as well as large-scale training methodologies. You will be responsible for conducting innovative research that pushes the boundaries of what is possible in AI technology, specifically in the context of video and audio content creation. Your work will involve transferring advanced technologies into ByteDance products, ensuring that our offerings remain at the leading edge of the industry. Additionally, you will explore new product opportunities that leverage artificial intelligence at their core, contributing to the evolution of TikTok's platform and enhancing user experiences. The Intelligent Creation Team, which you will be a part of, is dedicated to developing AI, special effects, and audio-video creation technologies. This team is responsible for a wide array of technical fields, including deep learning, computer vision, graphics, and speech processing. By providing cutting-edge content understanding and creation capabilities, the team plays a crucial role in delivering interactive experiences and industry solutions to both internal and external partners. Your contributions will not only impact TikTok's product offerings but also shape the future of content creation in the digital landscape.

Responsibilities

  • Conduct cutting-edge research and development in computer vision and machine learning, especially in the areas of multi-modal understanding, vision and language, and large-scale training.
  • Transfer advanced technologies to ByteDance products.
  • Explore new products with artificial intelligence technology at its core.

Requirements

  • At least 1 year of research and practical experience in one or more areas of computer vision, including multimodal understanding, vision and language, and large-scale training.
  • Experience in multimodal understanding, such as video highlight detection and slicing, and audio/music understanding.
  • Experience in vision and language tasks, such as image/video captioning, retrieval, and visual question answering (VQA).
  • Experience with language models and their application in various downstream tasks, particularly for intelligent editing.
  • Strong coding skills in C/C++ and Python, with a high competency in algorithms and programming.
  • Ability to work collaboratively with team members and independently.

Nice-to-haves

  • Publications in top-tier venues such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, ACL, EMNLP, or COLING.

Benefits

  • 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents.
  • Health Savings Account (HSA) with a company match.
  • Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans.
  • Flexible Spending Account (FSA) options for healthcare and dependent care.
  • 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) and 10 paid sick days per year.
  • 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.
  • Mental and emotional health benefits through EAP and Lyra.
  • 401K company match, gym and cellphone service reimbursements.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service