Tiktok - New York, NY

posted 2 months ago

Full-time - Mid Level
Hybrid - New York, NY
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

The Site Reliability Engineer (SRE) role at TikTok U.S. Data Security (USDS) focuses on enhancing the reliability and scalability of software systems while ensuring the protection of U.S. user data. This position involves collaborating with various engineering teams to operate and deploy software platforms, improve automation, and respond to incidents effectively. The SRE will play a crucial role in developing innovative solutions to complex challenges in a fast-paced environment, contributing to TikTok's mission of inspiring creativity and bringing joy.

Responsibilities

  • Work with infrastructure, product and platform engineering teams on operating and deploying software platforms, capacity planning, and launch reviews throughout the lifecycle of services.
  • Maintain sustainable reliability and scalability of software systems by improving automation to measure and monitor availability, latency, and overall system health.
  • Consistently evolve systems by pushing for changes that improve system reliability and release velocity.
  • Practice sustainable incident response and postmortems.

Requirements

  • BS degree in Computer Science, Computer Engineering, Electrical Engineering, or relevant majors with 2+ years of working experience.
  • Experience in programming, debugging, and optimization skills in general-purpose programming languages such as Go, Python, C/C++, Rust, or Java.
  • Experience in working with Unix/Linux systems from kernel to shell and beyond.
  • Experience in analyzing and debugging production issues at scale.
  • Experience and understanding of infrastructure-as-code concepts, approaches, methods, and tooling.

Nice-to-haves

  • Hands-on experience with large cloud providers such as AWS, Azure, GCP.
  • Code infrastructure with tools such as Kubernetes, Terraform, Ansible, Puppet, Chef, or SaltStack.
  • Secure infrastructure in a distributed system with automation or practice chaos engineering.
  • Experience with web application development, Unix/Linux environments, distributed and parallel systems, developing large software systems, mobile application development, and/or security software development.

Benefits

  • 100% premium coverage for employee medical insurance, approximately 75% premium coverage for dependents, and a Health Savings Account (HSA) with a company match.
  • Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans.
  • Flexible Spending Account (FSA) options for Health Care, Limited Purpose, and Dependent Care.
  • 10 paid holidays per year plus 17 days of Paid Personal Time Off (PPTO) and 10 paid sick days per year.
  • 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.
  • Mental and emotional health benefits through EAP and Lyra.
  • 401K company match, gym, and cellphone service reimbursements.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service