Mueller Water Productsposted 9 days ago
Full-time • Senior
Atlanta, GA
Fabricated Metal Product Manufacturing

About the position

We are currently searching for a Senior Site Reliability Engineer to join Mueller's Smart Water Infrastructure team. This role will be based in our Atlanta, GA on a hybrid office/ remote schedule. The Senior Site Reliability Engineer (SRE) is responsible for deployment, monitoring and ensuring the availability, reliability, scalability, and performance of software products against operational targets. They are responsible for the design, implementation, and maintenance of infrastructure required to support software products.

Responsibilities

  • Collaborate with software development teams to ensure that services are designed with availability, security, scalability, reliability, and performance in mind from the outset.
  • Monitor and manage live production environments, identifying and resolving issues as they arise and implementing long-term solutions to prevent their recurrence.
  • Develop and maintain automation tools for system health, performance monitoring, and incident response to ensure rapid detection and resolution of issues.
  • Resolve support issues where your experience is required to ascertain the issue quickly and to find an appropriate resolution.
  • Lead root cause analysis of critical outages, contributing to a culture of learning and continuous improvement.
  • Provide SRE/DevOps/Infrastructure services and guidance to the Software Team.
  • Support vendor-unmanaged services such as databases.
  • Co-ordinate with internal and external security and penetration tests and manage the prioritization and resolution of any findings.
  • Produce well-written documentation and architecture diagrams.
  • Be available 'out of hours' if required to complete specific tasks and support customers in emergency or disaster scenarios. This is not a usual and regular occurrence.
  • Mentor junior engineers, fostering a culture of technical excellence and collaborative problem-solving.

Requirements

  • Bachelor's or Master's degree in a computing or scientific/engineering discipline, or equivalent demonstrable experience.
  • 5+ years of Site Reliability Engineer experience.
  • Operational experience of AWS Serverless technologies
  • Linux and Windows system administration
  • CI/CD pipelines
  • Database Administration
  • Patch Management and Disaster and Recovery
  • Advanced Monitoring knowledge.
  • Automation scripting in a mainstream programming language
  • IaC

Nice-to-haves

  • Git
  • Monitoring tools (Datadog, Cloudwatch, Grafana).
  • Terraform
  • Coding experience outside scripting tools.
  • Networking understanding (DNS/Firewalls/Certificates).
  • Exposure to ISO certified environments.
  • Security fundamentals. Snyk, TFSec and other security tools.

Job Keywords

Hard Skills
  • AWS Serverless
  • Datadog
  • Firewall
  • Git
  • Infrastructure As A Service
  • 2k1pHxnz jN8ODBF
  • 4nVaeD65 CN6YKuB UQi0jDC5vPZNMEV
  • 8PEYxs1vp FkzY9ENIBGjKmeP
  • IqhDA59Gt yJcZRGvg
  • LEoYD XyZLI3hElHk
  • O3Ugzm
  • R67bgO
  • raqvCYwtlO
  • SbNq9IXVz TKq OM gljpHSfr
  • Ti5ar3E 9decKCnh
  • TVvBlhNdH Q2ScJa6wu
  • UbhyX0ZmAxqC HkxvE4Th95Bi
  • UcqOTJd6G eYiUbnju9 H8OaXSxC0To
  • VF3lN8HzW5m spuIJbrVwo
  • VJA3XEt MrLpTVfzigyujU w9LrAUI
  • x37kPM8TtJ6Q 1oKj6v8qwa
  • XA6y1 isuNZh TILwZlzrj
  • YMxAU1hj
  • ZMx7P SN8LYT OAz2YbSfu
Soft Skills
  • pIZ0xqWl ZYkls0xK
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service