Metro Systems - Charlotte, NC

posted 16 days ago

Full-time
Charlotte, NC
Transit and Ground Passenger Transportation

About the position

The Site Reliability Engineer (SRE) role at Tandym Group involves supporting a financial client in Charlotte by ensuring the stability and performance of production environments. The SRE will monitor system health, provide on-call support, and drive automation efforts to enhance operational efficiency. This position emphasizes proactive management of applications and services, focusing on observability and incident response to facilitate rapid feature development.

Responsibilities

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Support the applications with OnCall rotation support
  • Provide stability to applications and facilitate rapid feature development by taking active control of the service
  • Automate and eliminate manual work and look for opportunities for automation
  • Maintain and implement SLO implementation adoption and automation
  • Conduct Production Readiness/Health Scoring & Error Budget Tracking
  • Maintain and update runbook standards

Requirements

  • Experience using DevOps tools and technologies such as GitLab
  • Experience with Infrastructure as Code tools such as Terraform
  • Strong troubleshooting skills and ability to enhance observability using monitoring tools
  • Proactive approach to Observability maturity, identifying problems and performance bottlenecks
  • Experience leading incident response and supporting application teams
  • Ability to conduct blameless postmortems and provide developer feedback for enhanced logging and addressing technical debt
  • Experience with monitoring tools such as Dynatrace & Splunk
  • Experience in public cloud platforms, preferably AWS
  • Experience developing APIs, Microservices, or Frontend is a plus
  • Experience using source version control (SVC) such as Git

Nice-to-haves

  • AWS certification
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service