Tiktok - Seattle, WA

posted 3 months ago

Full-time - Mid Level
Seattle, WA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

TikTok is the leading destination for short-form mobile video, with a mission to inspire creativity and bring joy to over 1 billion users globally. The Data Platform Team at TikTok is focused on addressing challenges in data infrastructure and data products. This team is responsible for various critical components, including the Query Engine, Logging and Data Ingestion Infrastructure, Experimentation Platform, and Workflow Management Platform. The primary goal of the team is to support ad-hoc and interactive queries, manage batch pipelines, log and ingest large volumes of real-time data, and facilitate A/B testing for all product feature launches. As a Site Reliability Engineer (SRE) within the Data Platform area, you will have the unique opportunity to manage services and infrastructures that are part of one of the largest data platforms in the world. Your role will involve ensuring that the data, services, and infrastructures are reliable, fault-tolerant, efficiently scalable, and cost-effective. You will also engage in the entire lifecycle of service management, from inception and design through deployment, operation, and refinement. This position allows you to design, build, and deliver various systems as a software engineer, contributing to the overall success of TikTok's data initiatives. Your responsibilities will include maintaining services once they are live by measuring and monitoring availability, latency, and overall system health. You will practice sustainable incident response and conduct blameless postmortems to improve system reliability. Additionally, you will establish best engineering practices for both technical and non-technical team members, ensuring that the systems you design and implement are reliable, scalable, robust, and extensible, supporting TikTok's core products and business objectives.

Responsibilities

  • Engage in and improve the whole lifecycle of service, from inception and design, through to deployment, operation and refinement.
  • Ensure reliable, fault-tolerant, efficiently scalable and cost-effective data, services and infrastructures.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Practice sustainable incident response and blameless postmortems.
  • Establish best engineering practice for engineers as well as non-technical people.
  • Design and implement reliable, scalable, robust and extensible big data systems that support core products and business.

Requirements

  • BS or MS degree in Computer Science or related technical field or equivalent practical experience.
  • Experience in Big Data technologies (Hadoop, M/R, Hive, Spark, Metastore, Presto, Flume, Kafka, ClickHouse, Flink, etc.).
  • Experience with performing data analysis, data ingestion and data integration.
  • Solid communication and collaboration skills.

Benefits

  • 100% premium coverage for employee medical insurance
  • Approximately 75% premium coverage for dependents
  • Health Savings Account (HSA) with company match
  • Dental insurance
  • Vision insurance
  • Short/Long term Disability insurance
  • Basic Life insurance
  • Voluntary Life insurance
  • AD&D insurance
  • Flexible Spending Account (FSA) options for Health Care, Limited Purpose, and Dependent Care
  • 10 paid holidays per year
  • 17 days of Paid Personal Time Off (PPTO)
  • 10 paid sick days per year
  • 12 weeks of paid Parental leave
  • 8 weeks of paid Supplemental Disability
  • Mental and emotional health benefits through EAP and Lyra
  • 401K company match
  • Gym reimbursement
  • Cellphone service reimbursement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service