Tiktok - Seattle, WA

posted 8 days ago

Full-time - Mid Level
Seattle, WA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

As a Site Reliability Engineer (SRE) in the Data Platform team at TikTok U.S. Data Security (USDS), you will play a crucial role in managing and ensuring the reliability of one of the largest data platforms in the world. Your primary responsibility will be to maintain the services and infrastructures that support TikTok's mission of inspiring creativity and bringing joy to users. This involves ensuring that the data, services, and infrastructures are reliable, fault-tolerant, efficiently scalable, and cost-effective. You will have the opportunity to design, build, and deliver various systems as a software engineer, contributing to the overall performance and efficiency of the platform. In this role, you will establish solid design principles and best engineering practices for both technical and non-technical team members. You will be tasked with designing and building data transformations that are efficient and reliable for various purposes, including reporting, growth analysis, and multi-dimensional analysis. Additionally, you will implement robust and extensible big data systems that support core products and business operations. The SRE team is involved in multiple projects within the data infrastructure and product area, including developing a query engine for ad-hoc and interactive queries, creating logging and ingestion infrastructure for real-time data processing, and building an experimentation platform for A/B testing of product features. You will also work on a workflow management platform that automates the scheduling and monitoring of workflows, as well as various data products that enhance user behavior analysis and profiling. This position requires a collaborative approach, as you will work closely with cross-functional teams to drive impact and innovation.

Responsibilities

  • Ensure reliable, fault-tolerant, efficiently scalable and cost-effective data, services and infrastructures.
  • Establish solid design and best engineering practices for engineers and non-technical personnel.
  • Design and build data transformations efficiently and reliably for various purposes such as reporting and growth analysis.
  • Design and implement reliable, scalable, robust, and extensible big data systems that support core products and business operations.
  • Develop a query engine that supports ad-hoc and interactive queries and batch pipelines.
  • Create logging and ingestion infrastructure that logs and ingests large amounts of data in real-time.
  • Build an experimentation platform that supports A/B testing for all product and feature launches.
  • Develop a workflow management platform that programmatically authors, schedules, and monitors workflows such as streaming and batch tasks.
  • Implement distributed caching solutions to accelerate compute processes.
  • Work on various data products such as data visualization tools, user behavior analysis tools, and user profiling tools.

Requirements

  • Bachelor's degree in Computer Science, a related technical field involving software or systems engineering, or equivalent practical experience.
  • Experience writing code in Java, Go, Python, or a similar language.
  • Experience with algorithms, data structures, complexity analysis, and software design.

Nice-to-haves

  • Knowledge of strategies for ingesting, modeling, processing, and persisting data, including ETL design and dimensional modeling.
  • Familiarity with running production-grade services at scale and understanding cloud-native technologies and networking.
  • Experience developing tools and APIs to reduce human interaction with systems and applications using various coding and scripting standards.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems (e.g., Hadoop, M/R, Hive, Spark, Metastore, Presto, Flume, Kafka, ClickHouse, Flink).
  • Systematic problem-solving approach, effective communication skills, and a strong sense of drive.

Benefits

  • 100% premium coverage for employee medical insurance, approximately 75% for dependents.
  • Health Savings Account (HSA) with company match.
  • Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans.
  • Flexible Spending Account (FSA) options for healthcare and dependent care.
  • 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) and 10 paid sick days per year.
  • 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.
  • Mental and emotional health benefits through EAP and Lyra.
  • 401K company match, gym, and cellphone service reimbursements.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service