Manager, SRE

$70,000 - $90,000/Yr

Dematic - Grand Rapids, MI

posted 6 months ago

Full-time - Manager
Grand Rapids, MI
Machinery Manufacturing

About the position

We are looking for a dynamic, motivated hands-on SRE leader. If you are a hands-on individual contributor and motivated to move into a leadership role, we would like to speak with you. This is an exciting opportunity to join the Digital Solutions R&D team within Dematic. You'll work with a great team to enhance our digital software that hundreds of customers around the world use to manage buildings, staff, renewable energy infrastructure, and industrial machines used for material handling and warehouse automation. You will be growing and leading the distributed SRE team and will collaborate closely with DevOps, MLOps, engineering, security, and IT teams to build monitoring, observability, and alerting policies and standards. You will be responsible for establishing and maturing SRE best practices and building processes and tooling to monitor SRE metrics aligned with industry best practices. You will champion continuous improvement within your team. You will work with the live operations team to provide the support needed for new and existing customer projects.

Responsibilities

  • Develop, measure, and evolve SRE core capabilities following industry best practices.
  • Lead Dematic SRE community supporting projects with design, planning, and implementation of automation solutions and capabilities around continuous integration and delivery.
  • Own end-to-end responsibility for understanding, implementing, and maintaining SRE automation CI/CD pipelines.
  • Collaborate with DevOps, MLOps, engineering, architecture board, security, cloud governance, and other teams to help mature the adoption of SRE principles and processes for Dematic products.
  • Develop and evolve SRE tools, processes, and talent, collaborating closely with the infrastructure team, DevOps software development, security, and external providers.
  • Lead Incident Response and Root Cause Analysis.
  • Mentor, coach, and evaluate direct reports' performance and provide constructive feedback.
  • Assist in live site support and incident resolution.

Requirements

  • Bachelor's or Master's degree in Computer Science with 8+ years of software engineering experience.
  • 3+ years of experience leading distributed SRE engineering teams.
  • 5+ years of experience leading teams with CI/CD and IaC automation projects and initiatives.
  • Excellent troubleshooter spanning systems, networks, and code, utilizing a systematic problem-solving approach.
  • Proven track record decreasing MTTR (Mean-Time-To-Recovery), increasing MTTF (Mean-Time-To-Failure), and improving overall service quality.
  • 3+ years of experience leading Incident Response and root cause analysis (RCA).
  • Deep understanding of Terraform, Kubernetes, Docker, Serverless technologies, configuration management tools, etc.
  • 3+ years of experience in deploying and operating SaaS applications and cloud infrastructure (GCP or equivalent & On-Premise virtualized environments).
  • 5+ years of experience with CI/CD and DevOps tools like Jenkins, JFrog Artifactory, Gitlab, GCP DevOps, Artifactory, etc.
  • 3+ years of experience with at least one of the major cloud providers, GCP, Azure, AWS; GCP preferred.

Benefits

  • Career Development
  • Competitive Compensation and Benefits
  • Pay Transparency
  • Global Opportunities
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service