Insight Global - Bellevue, WA
posted 3 months ago
The Site Reliability Engineer (SRE) position at Insight Global in Bellevue, Washington, is designed for individuals with a strong background in software development and DevOps best practices. The ideal candidate will have over five years of experience in these areas, particularly within an Enterprise or Shared Service DevOps team. This role emphasizes the importance of automation, requiring proficiency in scripting languages such as Bash and Python. The SRE will be responsible for implementing and managing CI/CD pipelines using tools like Jenkins and GitLab CI/CD, ensuring smooth and efficient software delivery processes. In addition to automation, the role demands a solid understanding of infrastructure-as-code (IAC) principles, with hands-on experience using tools like Ansible and Terraform for infrastructure automation. Familiarity with Amazon Web Services (AWS) is also crucial, as the SRE will work extensively with cloud technologies. The candidate should possess strong problem-solving skills and a proactive approach to maintaining system health, troubleshooting complex issues, and responding to incidents. Experience in post-incident analysis and implementing preventive measures is essential to enhance system reliability and performance. The SRE will also be expected to work with observability tools, monitoring, and alerting systems to ensure that service level agreements (SLAs), service level objectives (SLOs), and service level indicators (SLIs) are met. A commitment to balancing reliability with continuous innovation and development is a key aspect of this role, as the SRE will contribute to creating a robust and scalable infrastructure that supports the company's growth and operational goals.