Bytedance - San Jose, CA
posted about 2 months ago
ByteDance, founded in 2012, is on a mission to inspire creativity and enrich life through its diverse suite of products, including TikTok and various platforms tailored for the Chinese market. The company is dedicated to fostering innovation and creativity, encouraging teams to tackle challenges with courage and a collaborative spirit. The Seed Team, established in 2023, focuses on developing advanced AI foundation models, conducting research in natural language processing, computer vision, and speech recognition. This team is committed to leading global research and driving technological and social progress, leveraging substantial data and computing resources to build proprietary models that support numerous business services. The LLM Global Data team plays a crucial role in producing international data for large language models (LLMs). Data is essential for the quality of these models, and the team collaborates closely with technical, product, and operations teams to ensure effective data production strategies. The responsibilities of this position include defining data requirements for LLMs, developing methodologies for data acquisition, monitoring costs, and ensuring high data quality. The role also involves evaluating the impact of data production tools and proactively identifying and mitigating biases in the data production process. This position is ideal for individuals who are passionate about AI and data quality, and who thrive in a collaborative environment.