Tiktok - Mountain View, CA

posted 2 months ago

Full-time - Mid Level
Hybrid - Mountain View, CA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

The Site Reliability Engineer (SRE) role at TikTok's U.S. Data Security (USDS) division focuses on operating and maintaining security infrastructures and platforms to protect user data and ensure system reliability. This position offers a unique opportunity to engage with innovative security initiatives and collaborate with cross-functional teams to enhance the TikTok platform's security and performance. The SRE will be involved in deploying scalable systems, improving automation, and responding to incidents, all while working in a hybrid environment that promotes collaboration and flexibility.

Responsibilities

  • Work with infrastructure, product and platform engineering teams on operating and deploying software platforms, capacity planning, and launch reviews throughout the lifecycle of services.
  • Maintain sustainable reliability and scalability of software systems by improving automation to measure and monitor availability, latency, and overall system health.
  • Consistently evolve systems by pushing for changes that improve system reliability and release velocity.
  • Practice sustainable incident response and postmortems.

Requirements

  • BS degree in Computer Science, Computer Engineering, Electrical Engineering, or relevant majors with 2+ years of working experience.
  • Experience in programming, debugging, and optimization skills in general-purpose programming languages such as Go, Python, C/C++, Rust, or Java.
  • Experience in working with Unix/Linux systems from kernel to shell and beyond.
  • Experience in analyzing and debugging production issues at scale.
  • Experience and understanding of infrastructure-as-code concepts, approaches, methods, and tooling.

Nice-to-haves

  • Hands-on experience with large cloud providers such as AWS, Azure, GCP.
  • Code infrastructure with tools such as Kubernetes, Terraform, Ansible, Puppet, Chef, or SaltStack.
  • Secure infrastructure in a distributed system with automation or practice chaos engineering.
  • Experience with web application development, Unix/Linux environments, distributed and parallel systems, developing large software systems, mobile application development, and/or security software development.

Benefits

  • 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents, and a Health Savings Account (HSA) with a company match.
  • Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans.
  • 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) and 10 paid sick days per year.
  • 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.
  • Mental and emotional health benefits through EAP and Lyra.
  • 401K company match, gym, and cellphone service reimbursements.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service