Mueller Water Productsposted 5 days ago
Full-time • Senior
Atlanta, GA
Fabricated Metal Product Manufacturing

About the position

We are currently searching for a Senior Site Reliability Engineer to join Mueller's Smart Water Infrastructure team. This role will be based in our Atlanta, GA on a hybrid office/ remote schedule. The Senior Site Reliability Engineer (SRE) is responsible for deployment, monitoring and ensuring the availability, reliability, scalability, and performance of software products against operational targets. They are responsible for the design, implementation, and maintenance of infrastructure required to support software products.

Responsibilities

  • Collaborate with software development teams to ensure that services are designed with availability, security, scalability, reliability, and performance in mind from the outset.
  • Monitor and manage live production environments, identifying and resolving issues as they arise and implementing long-term solutions to prevent their recurrence.
  • Develop and maintain automation tools for system health, performance monitoring, and incident response to ensure rapid detection and resolution of issues.
  • Resolve support issues where your experience is required to ascertain the issue quickly and to find an appropriate resolution.
  • Lead root cause analysis of critical outages, contributing to a culture of learning and continuous improvement.
  • Provide SRE/DevOps/Infrastructure services and guidance to the Software Team.
  • Support vendor-unmanaged services such as databases.
  • Co-ordinate with internal and external security and penetration tests and manage the prioritization and resolution of any findings.
  • Produce well-written documentation and architecture diagrams.
  • Be available 'out of hours' if required to complete specific tasks and support customers in emergency or disaster scenarios. This is not a usual and regular occurrence.
  • Mentor junior engineers, fostering a culture of technical excellence and collaborative problem-solving.

Requirements

  • Bachelor's or Master's degree in a computing or scientific/engineering discipline, or equivalent demonstrable experience.
  • 5+ years of Site Reliability Engineer experience.
  • Operational experience of AWS Serverless technologies
  • Linux and Windows system administration
  • CI/CD pipelines
  • Database Administration
  • Patch Management and Disaster and Recovery
  • Advanced Monitoring knowledge.
  • Automation scripting in a mainstream programming language
  • IaC

Nice-to-haves

  • Git
  • Monitoring tools (Datadog, Cloudwatch, Grafana).
  • Terraform
  • Coding experience outside scripting tools.
  • Networking understanding (DNS/Firewalls/Certificates).
  • Exposure to ISO certified environments.
  • Security fundamentals. Snyk, TFSec and other security tools.

Job Keywords

Hard Skills
  • AWS Serverless
  • Datadog
  • Firewall
  • Git
  • Infrastructure As A Service
  • 2vajqhcs
  • 4xPIYO
  • 5Zs6FXCou uO90R7FW812UGEb
  • 6ENgH Y7xbsy oHYkV58UW
  • 8PG4JOIztS
  • aBRVF7
  • astWB x7jFyX DGtXQyPBY
  • AVeSH4l7 gm34HxN 2uPJM5G3sxkCo4e
  • EydehmGX9 5bK86aumU LYDUR74693E
  • KfAptzVyT qSr GF 9O7EUK4h
  • nHqkU1e4zLo Q5nUjm4EZW
  • oBfJHVA1P xIiab13D9
  • pvMR2lDLOjHA SQkLVAPW8BEf
  • Qi3Sqh9K XzafTS3
  • rUxI0Gi Ux2oAR0h
  • ShqzP RwPAicCpFSD
  • xSWp2jMJH ajSQcpJw
  • z8nNteREAHZf 1NhDbFlcwX
  • ZdrxOzW xm9fGRQLpEWJgF JgcBVhm
Soft Skills
  • kBqRPpbZ q5WROVcN
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service