Tiktok - Mountain View, CA
posted 3 days ago
TikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S., created to enhance focus and governance on our data protection policies and content assurance protocols to ensure the safety of U.S. users. The teams within USDS are dedicated to providing oversight and protection of the TikTok platform and U.S. user data, allowing millions of Americans to continue using TikTok for learning, earning, self-expression, and entertainment. The Global E-commerce Site Reliability Engineer (SRE) team works closely with engineering and product teams to build and maintain large-scale, globally distributed, observable, and fault-tolerant systems. As an SRE, you will take ownership of production systems and be responsible for observability and automation across complex service mesh architectures. In this role, you will own the service level of a critical, revenue-generating E-commerce platform, focusing on service reliability, scalable design, and release management in a cloud-native environment. You will define service level indicators and data-driven objectives to improve uptime, latency, and system health of a core TikTok production platform. Collaboration with engineering and product teams is essential to ensure that key requirements such as capacity planning and launch reviews are performed to enable transparent service delivery to customers. Automation will be a key focus, aimed at infrastructure-as-code, scalability, and service resiliency. You will also implement SRE practices around incident management and post-mortems while participating in on-call rotations.