Tiktok - Seattle, WA
posted 3 months ago
TikTok is the leading destination for short-form mobile video, with a mission to inspire creativity and bring joy to over 1 billion users globally. Our Infrastructure Engineering team plays a crucial role in supporting the company's rapid growth by building and operating hyper-scale datacenters, managing the lifecycle of server fleets, providing cloud solutions, and developing various infrastructure services to ensure they are scalable and reliable. As a Site Reliability Engineer (SRE), you will combine software and systems engineering to build and run large-scale, massively distributed infrastructures. Your primary responsibility will be to ensure that our infrastructure services are reliable, fault-tolerant, efficiently scalable, and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including those that administer hyper-scale datacenters, public cloud, global content distribution networks (CDNs), and load balancers that handle terabits of traffic. In this role, you will be tasked with building, expanding, and operating Bytedance's global infrastructures, which include large-scale systems in both public and private clouds, data centers, and content delivery networks. You will also be responsible for building tools, automations, visualizations, and monitors to facilitate the operation and optimization of the global infrastructure. Working in a fast-paced environment, you will participate in technical operations and rotations in response to performance and reliability issues. Additionally, you will help improve the entire lifecycle of infrastructure services from inception and design through development, deployment, user support, and refinement.