Tiktok - Seattle, WA
posted 4 days ago
TikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S., created to enhance focus and governance on our data protection policies and content assurance protocols to ensure the safety of U.S. users. The teams within USDS are dedicated to providing oversight and protection of the TikTok platform and U.S. user data, allowing millions of Americans to continue using TikTok for learning, earning, self-expression, and entertainment. The Global E-commerce Site Reliability Engineer (SRE) team works closely with engineering and product teams to build and maintain large-scale, globally distributed, observable, and fault-tolerant systems. As an SRE, you will take ownership of production systems and be responsible for observability and automation across complex service mesh architectures. In this role, you will own the service level of a critical, revenue-generating E-commerce platform, focusing on service reliability, scalable design, and release management in a cloud-native environment. You will define service level indicators and data-driven objectives to maintain and improve uptime, latency, and system health of a core TikTok production platform. Collaboration with engineering and product teams is essential to ensure that key requirements, such as capacity planning and launch reviews, are met to enable transparent service delivery to customers. You will also engage in automation efforts aimed at infrastructure-as-code, scalability, and service resiliency, while implementing SRE practices around incident management and post-mortems, participating in on-call rotations as needed.