Tiktok - Seattle, WA

posted 4 days ago

Full-time - Entry Level
Seattle, WA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

As a Backend Software Engineer on the Recommendation - Data Infrastructure Team at TikTok, you will play a crucial role in designing and implementing a robust storage solution for offline data within TikTok's recommendation system, which serves over a billion users globally. Your work will focus on ensuring system reliability, uninterrupted service, and seamless performance, which are vital for the success of our recommendation, search, and advertising functions. You will be part of a team that is dedicated to creating a flexible, scalable, and high-performance storage system that can adapt to various data sources and accommodate diverse storage needs. In this position, you will be responsible for designing and implementing an offline and real-time data architecture that supports large-scale recommendation systems. You will also troubleshoot production systems and develop necessary mechanisms and tools to maintain overall system stability. Your contributions will include building industry-leading distributed systems, such as offline and online storage solutions, as well as batch and stream processing frameworks, which will provide reliable infrastructure for handling massive data and supporting large-scale business operations. The ideal candidate will have a strong background in computer science and experience in building scalable systems. You will be expected to demonstrate proficiency in big data processing systems like Spark or Flink, and have a deep understanding of data lake technologies such as Hudi, Iceberg, or DeltaLake. Your role will require strong coding skills in programming languages such as Java, C++, and Scala, along with the ability to troubleshoot effectively. You will also need to be enthusiastic about tackling challenging problems and eager to learn new technologies, especially in the context of managing large-scale data.

Responsibilities

  • Design and implement an offline/real-time data architecture for large-scale recommendation systems.
  • Design and implement a flexible, scalable, stable, and high-performance storage system and computation model.
  • Troubleshoot production systems and design necessary mechanisms and tools to ensure overall stability.
  • Build industry-leading distributed systems such as offline and online storage, batch, and stream processing frameworks.

Requirements

  • Bachelor's Degree or above in Computer Science or related fields with 1+ years of experience building scalable systems.
  • Proficiency in common big data processing systems like Spark/Flink at the source code level, with experience in customizing or extending these systems.
  • Deep understanding of the source code of at least one data lake technology, such as Hudi, Iceberg, or DeltaLake, with practical implementation or customization experience.
  • Knowledge of HDFS principles and familiarity with columnar storage formats like Parquet/ORC.
  • Prior experience in data warehousing modeling.
  • Proficiency in programming languages such as Java, C++, and Scala, with strong coding skills and troubleshooting ability.
  • Experience with other big data systems/frameworks like Hive, HBase, or Kudu is a plus.
  • Willingness to tackle challenging problems without clear solutions and enthusiasm for learning new technologies.

Nice-to-haves

  • Experience managing large-scale data in the petabyte range.
  • Familiarity with additional big data processing frameworks.

Benefits

  • 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents.
  • Health Savings Account (HSA) with a company match.
  • Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life and AD&D insurance plans.
  • Flexible Spending Account (FSA) options for healthcare and dependent care.
  • 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) and 10 paid sick days per year.
  • 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.
  • Mental and emotional health benefits through EAP and Lyra.
  • 401K company match, gym and cellphone service reimbursements.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service