Unclassified - Memphis, TN

posted 3 months ago

Full-time
Remote - Memphis, TN

About the position

The Site Reliability Engineer (SRE) position is a critical role that focuses on ensuring the reliability, availability, and performance of our systems and services. The SRE will be responsible for creating and managing build pipelines using tools such as Bamboo and GitLab, as well as troubleshooting any build and deployment failures that may arise. This role involves release management, where the engineer will work closely with development teams to solve complex performance and scaling issues, ensuring that we can meet traffic demands effectively, especially during peak times driven by organic growth and marketing events. In addition to managing CI/CD pipelines and production releases, the SRE will develop and maintain release architectures and monitoring frameworks that support the product team. This includes enhancing process flows for delivery and working within ServiceNow to create requests and changes as needed. The engineer will also assist developers through Jira tickets, analyze application logs for errors and performance bottlenecks, and document processes and procedures to ensure clarity and efficiency in operations. The SRE will champion improvements and drive efficiencies through teamwork, patch creation, and deployment, all while working with minimal direction. Effective communication is key, as the engineer will need to identify priorities and communication barriers, influence change, and be proactive in providing 24/7 production support. This role requires a strong technical background and the ability to work collaboratively across multiple teams to resolve technical issues and implement solutions.

Responsibilities

  • Create/manage Build pipelines in Bamboo and GitLab
  • Troubleshooting of build and deployment failures
  • Release Management
  • Solve sophisticated performance and scaling issues
  • Build and support CI/CD pipelines and production releases
  • Develop and maintain release architectures and monitor frameworks
  • Work within ServiceNow to create requests and changes
  • Assisting developers as requested through Jira tickets
  • Analyze application logs for errors and performance bottlenecks
  • Documentation of processes and procedures
  • Champion improvements and drive efficiencies through teamwork
  • Patch creation and deployment
  • Work with minimal direction
  • Identify priorities and communication barriers effectively
  • Influence change
  • Be proactive and be part of 24/7 production support

Requirements

  • 5+ years of professional experience in technology or a related field
  • 2+ years CI/CD pipeline experience
  • 2+ years experience with Kubernetes/EKS; pod life cycle experience such as readiness checks/liveness checks
  • 2+ years of Intermediate to Advanced skills in BASH shell scripting
  • 2+ years with Dynatrace APM and RUM (other APM or RUM may be applicable) - Dynatrace Associate Certification nice to have
  • 2+ years intermediate skills with on-prem GitLab CI pipeline creation, troubleshooting, and configuration of GitLab CI

Nice-to-haves

  • Working knowledge of complex CDN cached website architecture
  • Familiarity with reading and understanding JavaScript (Node.JS)
  • Strong experience in application and web servers like Tomcat, Apache etc.
  • Experience in Service Now and Jira for change management
  • Experience working within DevOps team
  • Effective communication with client on issues and opportunities for improvement
  • Ability to collaborate with multiple teams to resolve technical issues and fix problems
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service