Tiktok - Seattle, WA
posted 3 months ago
TikTok is the leading destination for short-form mobile video, with a mission to inspire creativity and bring joy to over 1 billion users globally. Our infrastructure team is responsible for operating a large network of Points of Presence (POPs) worldwide, hosting edge services such as traffic acceleration, CDN cache, and gaming. We are looking for a Senior Site Reliability Engineer to join our team, focusing on building a Kubernetes-based platform (PaaS) that manages the lifecycle of edge services across our globally distributed infrastructure. This role combines software and systems engineering to ensure that our infrastructure services are reliable, fault-tolerant, efficiently scalable, and cost-effective. As a Senior Site Reliability Engineer, you will have the opportunity to manage complex systems at scale, including hyperscale datacenters, public cloud environments, global content distribution networks (CDNs), and load balancers that handle terabits of traffic. You will be responsible for building, expanding, and operating Bytedance's global infrastructures, which include large-scale systems in both public and private clouds, data centers, and content delivery networks. Your role will involve creating tools, automations, visualizations, and monitors to facilitate the operation and optimization of our global infrastructure. You will work in a fast-paced environment, participating in technical operations and rotations to address performance and reliability issues, and help improve the entire lifecycle of infrastructure services from inception and design through development, deployment, user support, and refinement.