Tiktok - San Jose, CA

posted 3 days ago

Full-time - Mid Level
San Jose, CA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

As a Site Reliability Engineer Tech Lead at TikTok, you will play a crucial role in ensuring the reliability and performance of our services. TikTok is a leading platform for short-form mobile video, and our mission is to inspire creativity and bring joy to users around the world. Our Site Reliability Engineering (SRE) team combines software and systems engineering to build and maintain large-scale, distributed, and fault-tolerant systems. You will have the opportunity to tackle complex challenges related to system scale while leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design. In this position, you will engage in and enhance the entire lifecycle of services, from inception and design through development, capacity planning, launch reviews, deployment, operation, and refinement. You will design and implement software platforms and monitoring frameworks that facilitate efficient, automated, and intelligent service-oriented architecture (SOA) governance. Your role will also involve scaling systems sustainably through automation and driving improvements in system reliability, efficiency, and velocity. Additionally, you will practice sustainable user support, incident response, and conduct blameless postmortems to foster a culture of continuous improvement. This position requires a strong technical background, excellent problem-solving skills, and the ability to communicate effectively with team members and stakeholders. At TikTok, we value creativity and collaboration, and we are committed to creating an inclusive environment where diverse voices are celebrated.

Responsibilities

  • Engage in and improve the whole lifecycle of services from inception and design, throughout development, capacity planning, and launch reviews, to deployment, operation, and refinement.
  • Design and implement software platforms and monitor frameworks for efficient, automated, and intelligent service-oriented architecture (SOA) governance.
  • Scale systems sustainably through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes.
  • Practice sustainable user support, incident response, and blameless postmortems.

Requirements

  • Bachelor's degree in Computer Science or a related technical field with 5 years of experience.
  • Experience programming in one of the languages: C, C++, Java, Python, Go, and Rust.
  • Familiar with Unix/Linux system internals, networking, and distributed systems.
  • Preferred experience in designing and analyzing large-scale distributed systems.
  • Preferred strong skills in problem solving and communication.

Benefits

  • 100% premium coverage for employee medical insurance
  • Approximately 75% premium coverage for dependents
  • Health Savings Account (HSA) with company match
  • Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life and AD&D insurance plans
  • Flexible Spending Account (FSA) Options like Health Care, Limited Purpose and Dependent Care
  • 10 paid holidays per year
  • 17 days of Paid Personal Time Off (PPTO)
  • 10 paid sick days per year
  • 12 weeks of paid Parental leave
  • 8 weeks of paid Supplemental Disability
  • Mental and emotional health benefits through EAP and Lyra
  • 401K company match
  • Gym and cellphone service reimbursements
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service