Tiktok - Seattle, WA

posted 3 months ago

Full-time - Mid Level
Seattle, WA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

TikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a newly established subsidiary of TikTok in the U.S., created to enhance focus and governance over our data protection policies and content assurance protocols to ensure the safety of U.S. users. Our commitment is to provide oversight and protection of the TikTok platform and U.S. user data, allowing millions of Americans to continue using TikTok for learning, earning, creative expression, and entertainment. The teams within USDS that contribute to this mission include Trust & Safety, Security & Privacy, Engineering, User & Product Operations, Corporate Functions, and more. Joining TikTok means being part of a team that values creativity and collaboration. We believe that every challenge is an opportunity for learning, innovation, and growth. Our hybrid work model requires employees to work in the office three days a week, fostering collaboration and cross-functional partnerships. This model is regularly reviewed, and specific requirements may change as needed. In this role, you will engage in and improve the entire lifecycle of Recommendation systems, from system design consulting to launch reviews, deployment, operation, and refinement. You will deliver tools and software to enhance the reliability and scalability of services, automate operations, and improve R&D efficiency. Additionally, you will be responsible for building the availability of large-scale services deployed across global data centers, planning and managing cloud resource utilization, and ensuring the SLA of large-scale clusters. Monitoring service health, latency, and availability will also be key components of your responsibilities, along with practicing sustainable incident response and conducting postmortems.

Responsibilities

  • Engage in and improve the whole lifecycle of Recommendation systems from system design consulting through to launch reviews, deployment, operation, and refinement.
  • Deliver tools/software to improve the reliability and scalability of services, automate operations, and improve R&D efficiency.
  • Build availability of large-scale services deployed across global data centers.
  • Plan, manage, and optimize cloud resources utilization, ensuring SLA of large-scale clusters.
  • Measure and monitor availability, latency, and overall service health.
  • Practice sustainable incident response and postmortems.

Requirements

  • Bachelor's degree or above majoring in Computer Science or related fields, with at least 2 years of related work experience.
  • Experience in SRE of large-scale systems deployment with high reliability and scalability.
  • Familiar with system operation skills in Linux and network.
  • Experience programming in at least one of the following languages: Python, Perl, Go, or C/C++.
  • Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Familiar with popular CI/CD procedures and environments.
  • Effective communication skills and a sense of ownership and drive.

Benefits

  • Inclusive workplace culture
  • Reasonable accommodations for candidates with disabilities or other protected reasons
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service