RingCentral - Seattle, WA
posted 4 months ago
RingCentral is seeking a Senior Site Reliability Engineer to join our Infrastructure team, which is responsible for providing robust Docker infrastructure for both LAB environments and Production services across our Data Centers and Amazon Cloud. This role is pivotal in ensuring the reliability and efficiency of our distributed and high-load systems. As a Senior Site Reliability Engineer, you will engage in the design and implementation of Kubernetes-based solutions, focusing on automation for deployments of clusters in both cloud and on-premise facilities. You will also contribute to the development of RingCentral's proprietary infrastructure solutions, including DNS, package and image repositories, mail relays, and NTP, while building app packages for our tools deployments. Your responsibilities will encompass the full lifecycle of our services, including development, modification, deployment, and support. You will provide ongoing support for our production environment, which involves delivering changes, troubleshooting issues, performing root cause analysis, and facilitating blameless postmortems. Additionally, you will research and develop solutions for Docker infrastructure, manage DNS systems, RPM repositories, and OS customization, and work with Infrastructure as Code (IaaC) using Terraform. Your role will also involve actively contributing to production stability improvements through monitoring, process enhancements, and automation, as well as developing infrastructure solutions and working with CI/CD processes. Collaboration is key in this position, as you will communicate and work closely with adjacent teams such as Operations, QA, and Development. Continuous improvement and knowledge sharing will be essential, along with maintaining up-to-date documentation. This role requires a strong understanding of SRE principles and DevOps practices, as well as proficiency in Linux and popular open-source applications. You will be expected to automate system administration tasks using configuration management tools and have a good command of scripting languages like Bash or Python. Familiarity with CI/CD tools and application monitoring techniques will also be crucial for success in this role.