Robert Half - Maumee, OH

posted 11 days ago

Full-time
Maumee, OH
Administrative and Support Services

About the position

The Site Reliability Engineer will play a crucial role in enhancing system reliability and automation processes, delivering operational insights through analytics. This position involves close collaboration with DevOps and application development teams to ensure the highest levels of availability, reliability, and security of products and services.

Responsibilities

  • Design and implement highly available, scalable, and fault-tolerant infrastructure.
  • Collaborate with engineering teams to define and implement reliability standards and best practices.
  • Automate infrastructure provisioning, configuration, and deployment processes to streamline operations.
  • Work with software engineers to design and implement deployment strategies using automated continuous integration and delivery pipelines.
  • Monitor system performance and proactively identify potential issues to ensure uptime and optimal performance.
  • Collaborate with software engineering teams to improve system reliability through automated testing, fault tolerance, and disaster recovery planning.
  • Lead incident management efforts, overseeing response processes and coordinating with cross-functional teams.
  • Design and implement incident response playbooks and escalation procedures for timely and effective resolution.
  • Conduct post-incident reviews to identify root causes and implement preventative measures.
  • Develop and implement robust observability solutions to gain deeper insights into system performance.

Requirements

  • Proficient in Continuous Integration / Continuous Delivery (CICD)
  • Strong knowledge of Python programming language
  • Experience with Infrastructure as Code
  • Familiarity with Computer Security Incident Response Team operations
  • Solid understanding of Disaster Recovery strategies
  • Proficiency in using Ansible for configuration management
  • Experience with Splunk for log management and analysis
  • Ability to use Grafana for data visualization
  • Practical knowledge of Terraform for infrastructure management
  • Understanding of DevOps methodologies
  • Experience in DevOps Engineering and using DevOps Tools
  • Ability to work collaboratively in a team and independently
  • Excellent problem-solving skills and attention to detail
  • Strong verbal and written communication skills
  • Bachelor's degree in Computer Science or a related field, or equivalent work experience
  • Relevant industry certifications would be a plus.

Benefits

  • Medical insurance
  • Vision insurance
  • Dental insurance
  • Life insurance
  • Disability insurance
  • 401(k) plan
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service