Mphasis - Atlanta, GA
posted 3 months ago
We are seeking a Senior Site Reliability Engineer (SRE) with over 5 years of experience to join our team in Atlanta, GA. The ideal candidate will work closely with client IT development squads to implement best practices for reliability and performance in the applications and services they support. This role requires a deep understanding of modern cloud-based and on-premises architecture, as well as experience in designing systems for reliability. The SRE will be responsible for implementing monitoring, logging, and operational automation to ensure the reliable operation and maintenance of the services they build. The successful candidate will have a strong background in application development or SRE, with at least 2 years of experience in operations automation using scripting languages such as Python or Ansible. Familiarity with Sumologic and APM tools like Dynatrace, New Relic, AppDynamics, or Datadog is preferred. Knowledge of reliability engineering theories and methodologies is essential, as well as the ability to design, develop, and support various tools and applications to maintain a reliable site environment. Additionally, the candidate should possess skills in performance measurement and tuning, with the ability to monitor, measure, and optimize system performance and network communication. Experience with AWS CI/CD pipelines is crucial, as the SRE will be expected to design, build, implement, and maintain CI/CD pipelines to automate the software delivery process. A Bachelor's degree in Computer Science, Information Technology, or a related field is preferred, along with experience in airline applications and infrastructure technology, which would be a plus.