This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Zscalerposted 28 days ago
$161,000 - $230,000/Yr
Full-time • Senior
Hybrid • San Jose, CA
Professional, Scientific, and Technical Services
Resume Match Score

About the position

Zscaler is looking for an experienced Principal Site Reliability Engineer to join our Engineering Team, reporting to the VP of Engineering. This is a hybrid role going into our San Jose, CA office 3 days a week. In this role, you will work with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code. You will support large-scale services, manage high-pressure situations, and participate in on-call rotations. Additionally, you will develop and enhance tools for large scale services technologies, ensuring high standards in system design and code quality. Your responsibilities will also include diagnosing and fixing issues by editing code, modifying infrastructure configurations, conducting network and performance analysis, and creating reusable tooling. You will develop automation tools and optimize services through version-controlled infrastructure-as-code.

Responsibilities

  • Work with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code.
  • Support large-scale services, manage high-pressure situations, and participate in on-call rotations.
  • Develop and enhance tools for large scale services technologies, ensuring high standards in system design and code quality.
  • Diagnose and fix issues by editing code, modifying infrastructure configurations, conducting network and performance analysis and creating reusable tooling.
  • Develop automation tools and optimize services through version-controlled infrastructure-as-code.

Requirements

  • 8-10+ years of relevant experience working in SRE teams, supporting mission critical production service.
  • Deep understanding of SRE principles, practices, and tools.
  • Experience with large scale distributed systems, cloud platforms (AWS, GCP, Azure) and infrastructure as code.
  • Experience with incident response including resolving system failures and outages, with a focus on engineering solutions in support of production reliability.

Nice-to-haves

  • Bachelor's Degree in Computer Science, Management Information Systems, or equivalent experience.
  • Proficiency in Python, Golang, Java, or Rust. Experience working in a standard SDLC.

Benefits

  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks, and more!

Job Keywords

Hard Skills
  • Infrastructure as Code
  • Java
  • Python
  • Rust
  • Version Control
  • 68lYTnHtNPzq W3HuYtQIFM8
  • 6tLpMRsWF X1idQvZGI
  • 7wOGiMP6v 5qXabIDu
  • IaFQWsrU 2trAiTgP
  • SENM78ZvTL1m BbvuAgt7ZoTm
  • sO9R30WAKiyl 8LeKAIHqO
  • swK7kr3M fnYpsRLB
  • x70gSqe Ax0O75
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service