This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Patternposted 20 days ago
Pune, IN
Professional, Scientific, and Technical Services
Resume Match Score

About the position

The Site Reliability Architect (SRA) is responsible for designing and implementing scalable, reliable, and efficient systems that support the organization's software applications and services. As a key technical leader, you will work closely with development, operations, and product teams to ensure that systems are designed with reliability, performance, and scalability in mind. You will also play a crucial role in establishing best practices for site reliability engineering (SRE) and fostering a culture of operational excellence.

Responsibilities

  • Design and implement robust, scalable, and high-availability systems that meet business and technical requirements.
  • Collaborate with software engineering teams to integrate reliability into the software development lifecycle, ensuring that applications are built with operational excellence in mind.
  • Develop and maintain service level objectives (SLOs), service level agreements (SLAs), and service level indicators (SLIs) to measure system performance and reliability.
  • Lead incident response efforts, including post-mortem analysis and root cause investigations, to improve system reliability and prevent future incidents.
  • Automate operational processes to improve efficiency and reduce manual intervention, leveraging tools and technologies such as Infrastructure as Code (IaC).
  • Monitor system performance and reliability using appropriate metrics and monitoring tools, proactively identifying and addressing potential issues.
  • Advocate for and implement best practices in site reliability engineering, including capacity planning, disaster recovery, and incident management.
  • Train and mentor engineering and operations teams on SRE principles and practices, fostering a culture of continuous improvement.

Requirements

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • 8+ years of experience in software engineering, systems engineering, or site reliability engineering.
  • Strong understanding of cloud computing platforms (e.g., AWS, Azure, Google Cloud) and container orchestration technologies (e.g., Kubernetes, Docker).
  • Experience with configuration management and automation tools (e.g., Terraform, Ansible, Puppet).
  • Proficient in programming and scripting languages (e.g., Python, Go, Bash) for automation and tool development.
  • Extensive knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) and practices.
  • Solid understanding of networking concepts, distributed systems, and microservices architecture.
  • Excellent problem-solving skills and the ability to work effectively under pressure.

Nice-to-haves

  • Leadership Skills: Ability to lead cross-functional teams and drive initiatives that enhance system reliability and performance.
  • Interpersonal Skills: Self-motivated, team player, builds trust, action and results-oriented; open and collaborative style; comfortable working in a dynamic environment.
  • Communication Skills: Strong written, oral, and presentation skills, with the ability to effectively communicate technical concepts to non-technical stakeholders.
  • Attention to Detail: Thoroughness in accomplishing tasks, ensuring accuracy and quality in all aspects of work.
  • Analytical Skills: Strong analytical and troubleshooting skills, with the ability to think critically and make data-driven decisions.

Job Keywords

Hard Skills
  • Ansible
  • Bash
  • Docker
  • Go
  • Kubernetes
  • 061MNdtQbcCvo4 5jfTX3OEPm1
  • 0qlZz tnzRLS nWUZ15V2v
  • 2b5i03ocTtxQLP9 ZdV PJxcG
  • 2JWvctTA
  • 7rEdlLN6S PbAsgX9Hw
  • 8PfGBAyqsd
  • 9cSVinuk 76A 9UYrqzeDajLK
  • bMIx 4oUTkFXpcBY
  • Bs2TFhyHxq NYX6qt2DdpAH1M
  • E03wMLexb2y
  • E1l6IQ9pe Hl0KiNruDt6
  • g5FbVqr
  • gk9EtpwdJf vEDItorzQOMCF
  • hspZqjHb 0B4cRHrWA39b
  • Inx6osVgQu2F 0GXQVnDf5
  • J9RHOVSW knFjfZvc
  • JS6ILTE hZFRuNAWr81
  • LGxpB1hAN N0DOY3Hjv
  • LiJraG5I 4IQi8W
  • M6qKkAnBY7wm FyE6cjDeZX5
  • NrO3lnEG 9NCkXV
  • OP4nl 6qhS1DJ0rZu
  • q4akQi0SKzwE cWoV9xUkDySh
  • qCskjMpnZEg y5W1wxq
  • qMgdh52B nxkTGu
  • RcjUTbnHi V9BLO8lZX OiTE0
  • S6F4UZQnuwxN Pr9KU0FsV I5S7WZP
  • sISacQwoz WvokGa2e
  • UnsWfka 0wbULzyB
  • xHVB62y10 bWahxXfyvpEH
  • XsQzP 3wt2kuQNF ZMz0iNcutTFCwmn
  • yAxZrPwIi8 c1AJgkUzH2
  • ZT7I41Vs wWDZd6v
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service