Allegis Group - Atlanta, GA

posted 2 months ago

Full-time - Mid Level
Atlanta, GA
10,001+ employees
Administrative and Support Services

About the position

TEKsystems is seeking an experienced Site Reliability Engineer (SRE) to join our top client in Charlotte, NC. The ideal candidate will have a strong background in monitoring, maintaining, and building out an OpenShift platform. This role is critical in ensuring the reliability and performance of the systems, and the SRE will be responsible for handling support tickets while also focusing on reducing the volume of tickets through automation efforts on the platform. The SRE will work closely with development and operations teams to implement best practices in site reliability and will be instrumental in driving improvements in system performance and availability. The SRE will be expected to work on-site three days a week in Atlanta, GA, collaborating with team members and stakeholders to ensure seamless operations. The role requires a proactive approach to problem-solving and a commitment to continuous improvement. The successful candidate will leverage their expertise in cloud technologies, particularly Azure, and will utilize their skills in Linux and Python to automate processes and enhance system reliability. This position offers an exciting opportunity to work in a dynamic environment and contribute to the success of a leading organization.

Responsibilities

  • Monitor and maintain the OpenShift platform to ensure high availability and performance.
  • Handle support tickets and troubleshoot issues as they arise.
  • Implement automation solutions to reduce the volume of support tickets.
  • Collaborate with development and operations teams to improve site reliability practices.
  • Participate in incident response and post-mortem analysis to identify areas for improvement.
  • Develop and maintain documentation related to system configurations and processes.

Requirements

  • Proven experience as a Site Reliability Engineer or similar role.
  • Strong knowledge of OpenShift and cloud technologies, particularly Azure.
  • Proficiency in Linux operating systems and command-line tools.
  • Experience with automation tools and scripting languages, especially Python.
  • Ability to troubleshoot complex systems and provide effective solutions.

Nice-to-haves

  • Familiarity with container orchestration and management tools.
  • Experience with monitoring and logging tools.
  • Knowledge of networking concepts and protocols.

Benefits

  • Medical, dental & vision insurance
  • Critical Illness, Accident, and Hospital coverage
  • 401(k) Retirement Plan with pre-tax and Roth post-tax contributions
  • Voluntary Life Insurance & AD&D for employees and dependents
  • Short and long-term disability insurance
  • Health Spending Account (HSA)
  • Transportation benefits
  • Employee Assistance Program
  • Paid Time Off (PTO), Vacation, or Sick Leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service