Tiktok - San Jose, CA
posted 3 days ago
As a Site Reliability Engineer Tech Lead at TikTok, you will play a crucial role in ensuring the reliability and performance of our services. TikTok is a leading platform for short-form mobile video, and our mission is to inspire creativity and bring joy to users around the world. Our Site Reliability Engineering (SRE) team combines software and systems engineering to build and maintain large-scale, distributed, and fault-tolerant systems. You will have the opportunity to tackle complex challenges related to system scale while leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design. In this position, you will engage in and enhance the entire lifecycle of services, from inception and design through development, capacity planning, launch reviews, deployment, operation, and refinement. You will design and implement software platforms and monitoring frameworks that facilitate efficient, automated, and intelligent service-oriented architecture (SOA) governance. Your role will also involve scaling systems sustainably through automation and driving improvements in system reliability, efficiency, and velocity. Additionally, you will practice sustainable user support, incident response, and conduct blameless postmortems to foster a culture of continuous improvement. This position requires a strong technical background, excellent problem-solving skills, and the ability to communicate effectively with team members and stakeholders. At TikTok, we value creativity and collaboration, and we are committed to creating an inclusive environment where diverse voices are celebrated.