Senior SRE Engineer

$107,000 - $153,000/Yr

RingCentral - Seattle, WA

posted 5 months ago

Full-time - Senior
Hybrid - Seattle, WA
501-1,000 employees
Professional, Scientific, and Technical Services

About the position

RingCentral is seeking a Senior Site Reliability Engineer to join our Infrastructure team, which is responsible for providing robust Docker infrastructure for both LAB environments and Production services across our Data Centers and Amazon Cloud. This role is pivotal in ensuring the reliability and efficiency of our distributed and high-load systems. As a Senior Site Reliability Engineer, you will engage in the design and implementation of Kubernetes-based solutions, focusing on automation for deployments of clusters in both cloud and on-premise facilities. You will also contribute to the development of RingCentral's proprietary infrastructure solutions, including DNS, package and image repositories, mail relays, and NTP, while building app packages for our tools deployments. Your responsibilities will encompass the full lifecycle of our services, including development, modification, deployment, and support. You will provide ongoing support for our production environment, which involves delivering changes, troubleshooting issues, performing root cause analysis, and facilitating blameless postmortems. Additionally, you will research and develop solutions for Docker infrastructure, manage DNS systems, RPM repositories, and OS customization, and work with Infrastructure as Code (IaaC) using Terraform. Your role will also involve actively contributing to production stability improvements through monitoring, process enhancements, and automation, as well as developing infrastructure solutions and working with CI/CD processes. Collaboration is key in this position, as you will communicate and work closely with adjacent teams such as Operations, QA, and Development. Continuous improvement and knowledge sharing will be essential, along with maintaining up-to-date documentation. This role requires a strong understanding of SRE principles and DevOps practices, as well as proficiency in Linux and popular open-source applications. You will be expected to automate system administration tasks using configuration management tools and have a good command of scripting languages like Bash or Python. Familiarity with CI/CD tools and application monitoring techniques will also be crucial for success in this role.

Responsibilities

  • Provide ongoing support for production environment in terms of delivering changes, troubleshoot, manage issues, perform root cause analysis, facilitate blameless postmortems
  • Research and develop solutions for Docker (Kubernetes, Spinnaker) infrastructure in Data Centers and Amazon
  • Manage DNS systems, RPM repositories, OS customization, and GIT
  • Work on Infrastructure as Code (IaaC) with Terraform
  • Actively contribute to production stability improvements (monitoring, process improvements, automation)
  • Develop infrastructure solutions
  • Work with CI/CD processes - use and create
  • Design automated software and product upgrades, change management, and release management solutions
  • Communicate and collaborate with adjacent teams (Operations, QA, Development)
  • Continuously improve and share knowledge of system, update documentation

Requirements

  • FedRAMP (Must be a US Citizen)
  • Bachelor's degree in Computer Science, Computer Engineering or related field and equivalent practical experience
  • Knowledge and experience in production support activities
  • Understanding of SRE principles and DevOps practices
  • Good knowledge of Linux and popular open-source applications
  • Fundamental understanding of TCP/IP networks
  • Expertise automating system administration tasks with configuration management tools
  • Good knowledge of one of the scripting languages (bash, Python)
  • Knowledge of CI/CD tools and ability to implement deployment activities automation
  • Familiar with application and service monitoring tools and techniques

Nice-to-haves

  • Go Development experience
  • Ability to read and troubleshoot Python/Go code
  • Familiarity and practical experience with virtualization/cloud technologies
  • Familiarity with Networking protocols such as BGP
  • Experience in ELK usage, Grafana
  • Experience with Docker infrastructure deployment from scratch

Benefits

  • Comprehensive medical, dental, vision, disability, life insurance
  • Health Savings Account (HSA), Flexible Spending Account (FSAs) and Commuter benefits
  • 401K match and ESPP
  • Paid time off and paid sick leave
  • Wellness programs including 1:1 coaching and meditation guidance
  • Paid parental and pregnancy leave and new parent gift boxes
  • Family-forming benefits (IVF, Preservation, Adoption etc.)
  • Emergency backup care (Child/Adult/Pets)
  • Pet insurance and Pet Telehealth
  • Employee Assistance Program (EAP) with counseling sessions available 24/7
  • Free legal services that provide legal advice, document creation and estate planning
  • Employee bonus referral program
  • Student loan refinancing assistance
  • Employee perks and discounts program
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service