Tiktok - Seattle, WA
posted 3 months ago
TikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. As part of the U.S. Data Security (USDS) team, the Site Reliability Engineer (SRE) will play a crucial role in ensuring the reliability and performance of our services. This position combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. The SRE will engage in and improve the entire lifecycle of services, from inception and design through development, capacity planning, launch reviews, deployment, operation, and refinement. The role requires designing and implementing software platforms and monitoring frameworks for efficient, automated, and intelligent service-oriented architecture (SOA) governance. The SRE will also be responsible for scaling systems sustainably through automation and evolving system reliability, efficiency, and velocity by advocating for necessary changes. Additionally, the position involves practicing sustainable user support, incident response, and conducting blameless postmortems to learn from incidents and improve future performance. At TikTok, we embrace a culture of diversity, intellectual curiosity, openness, and problem-solving. We encourage close collaboration while promoting self-direction. The ideal candidate will have a strong background in programming and systems engineering, with experience in managing complex challenges of scale. This role is essential in maintaining the integrity and performance of TikTok's services, ensuring that millions of users can continue to express themselves creatively and be entertained safely.