This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Principal Site Reliability Engineer

Zscalerposted 27 days ago

$161,000 - $230,000/Yr

Full-time • Senior

San Jose, CA

About the position

Zscaler is looking for an experienced Principal Site Reliability Engineer to join our Engineering Team, reporting to the VP of Engineering. This is a hybrid role going into our San Jose, CA office 3 days a week. In this role, you will work with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code. You will support large-scale services, manage high-pressure situations, and participate in on-call rotations. Additionally, you will develop and enhance tools for large scale services technologies, ensuring high standards in system design and code quality. You will also diagnose and fix issues by editing code, modifying infrastructure configurations, conducting network and performance analysis and creating reusable tooling. Furthermore, you will develop automation tools and optimize services through version-controlled infrastructure-as-code.

Responsibilities

Work with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code.
Support large-scale services, manage high-pressure situations, and participate in on-call rotations.
Develop and enhance tools for large scale services technologies, ensuring high standards in system design and code quality.
Diagnose and fix issues by editing code, modifying infrastructure configurations, conducting network and performance analysis and creating reusable tooling.
Develop automation tools and optimize services through version-controlled infrastructure-as-code.

Requirements

8-10+ years of relevant experience working in SRE teams, supporting mission critical production service.
Deep understanding of SRE principles, practices, and tools.
Experience with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code.
Experience with incident response including resolving system failures and outages, with a focus on engineering solutions in support of production reliability.

Nice-to-haves

Bachelor's Degree in Computer Science, Management Information Systems, or equivalent experience.
Proficiency in Python, Golang, Java, or Rust. Experience working in a standard SDLC.

Benefits

Various health plans
Time off plans for vacation and sick time
Parental leave options
Retirement options
Education reimbursement
In-office perks, and more!

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder

Principal Site Reliability Engineer

About the position

Responsibilities

Requirements

Nice-to-haves

Benefits

Tools

Career Hubs

Guides

Company