Unclassified - Memphis, TN

posted 3 months ago

Full-time
Remote - Memphis, TN

About the position

The Site Reliability Engineer (SRE) position is a critical role within our technology team, focusing on ensuring the reliability, performance, and scalability of our systems. The SRE will be responsible for creating and managing build pipelines using tools such as Bamboo and GitLab, as well as troubleshooting any build and deployment failures that may arise. This role requires a proactive approach to release management, where the engineer will work closely with development teams to solve complex performance and scaling issues, ensuring that we can meet traffic demands effectively, especially during peak times driven by organic growth and marketing events. In addition to managing CI/CD pipelines and production releases, the SRE will develop and maintain release architectures and monitoring frameworks that support the product team. This includes working within ServiceNow to create requests and changes, assisting developers through Jira tickets, and analyzing application logs to identify errors and performance bottlenecks. Documentation of processes and procedures is also a key responsibility, as is championing improvements and driving efficiencies through teamwork. The SRE will be expected to work with minimal direction, effectively identify priorities, and communicate barriers to ensure smooth operations. This role is part of a 24/7 production support team, requiring the engineer to be proactive and ready to influence change as needed. The ideal candidate will have a strong background in technology, particularly in CI/CD practices, Kubernetes, and scripting, and will be comfortable collaborating with multiple teams to resolve technical issues and implement solutions.

Responsibilities

  • Create/manage Build pipelines in Bamboo and Gitlab
  • Troubleshooting of build and deployment failures
  • Release Management
  • Solve sophisticated performance and scaling issues
  • Build and support CI/CD pipelines and the production releases
  • Develop and maintain release architectures and monitor frameworks
  • Work within ServiceNow to create requests and changes
  • Assisting developers as requested through Jira tickets
  • Analyze application logs for errors and performance bottlenecks
  • Documentation of processes and procedures
  • Champion improvements and drive efficiencies through teamwork
  • Patch creation and deployment
  • Work with minimal direction
  • Identify priorities and communication barriers effectively
  • Influence change
  • Be proactive and be part of 24/7 production support

Requirements

  • 5+ years of professional experience in technology or a related field
  • 2+ years CI/CD pipeline experience
  • 2+ years experience with Kubernetes/EKS; pod life cycle experience such as readiness checks/liveness checks
  • 2+ years of Intermediate to Advanced skills in BASH shell scripting
  • 2+ years with Dynatrace APM and RUM (other APM or RUM may be applicable) - Dynatrace Associate Certification nice to have
  • 2+ years intermediate skills with on-prem GitLab CI pipeline creation, troubleshooting, and configuration of GitLab CI
  • Education: Bachelor's Degree
  • Certification: Dynatrace Associate Certification

Nice-to-haves

  • Working knowledge of complex CDN cached website architecture
  • Familiarity with reading and understanding JavaScript (Node.JS)
  • Strong experience in application and web servers like Tomcat, Apache etc.
  • Experience in Service Now and Jira for change management
  • Experience working within DevOps team
  • Effective communication with client on issues and opportunities for improvement
  • Ability to collaborate with multiple teams to resolve technical issues and fix problems
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service