This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Zscalerposted 27 days ago
$161,000 - $230,000/Yr
Full-time • Senior
San Jose, CA
Resume Match Score

About the position

Zscaler is looking for an experienced Principal Site Reliability Engineer to join our Engineering Team, reporting to the VP of Engineering. This is a hybrid role going into our San Jose, CA office 3 days a week. In this role, you will work with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code. You will support large-scale services, manage high-pressure situations, and participate in on-call rotations. Additionally, you will develop and enhance tools for large scale services technologies, ensuring high standards in system design and code quality. You will also diagnose and fix issues by editing code, modifying infrastructure configurations, conducting network and performance analysis and creating reusable tooling. Furthermore, you will develop automation tools and optimize services through version-controlled infrastructure-as-code.

Responsibilities

  • Work with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code.
  • Support large-scale services, manage high-pressure situations, and participate in on-call rotations.
  • Develop and enhance tools for large scale services technologies, ensuring high standards in system design and code quality.
  • Diagnose and fix issues by editing code, modifying infrastructure configurations, conducting network and performance analysis and creating reusable tooling.
  • Develop automation tools and optimize services through version-controlled infrastructure-as-code.

Requirements

  • 8-10+ years of relevant experience working in SRE teams, supporting mission critical production service.
  • Deep understanding of SRE principles, practices, and tools.
  • Experience with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code.
  • Experience with incident response including resolving system failures and outages, with a focus on engineering solutions in support of production reliability.

Nice-to-haves

  • Bachelor's Degree in Computer Science, Management Information Systems, or equivalent experience.
  • Proficiency in Python, Golang, Java, or Rust. Experience working in a standard SDLC.

Benefits

  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks, and more!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service