Evolent Health - Raleigh, NC
posted about 2 months ago
As an Associate Site Reliability Engineer at Evolent, you will play a crucial role in managing our extensive application suite and cloud infrastructure. This position is part of the Platform Engineering organization, where you will be instrumental in transforming how we manage cloud infrastructure and application reliability. Your contributions will directly impact our ability to provide high-quality care to our clients by ensuring that our systems are reliable and efficient. You will be joining a highly talented team that values collaboration and innovation, and you will have the opportunity to work on exciting projects that enhance our operational capabilities. In this role, you will be responsible for identifying and implementing solutions for recurring application problems, which is essential for increasing application reliability. You will execute corrective actions identified during post-incident reviews and root cause analyses, ensuring that we learn from our experiences and continuously improve our processes. Your participation in incident management and after-hours support will be vital in maintaining the integrity of our systems. You will also maintain observability solutions to gather and analyze system metrics, helping to identify performance bottlenecks and resolve them effectively. Automation will be a key focus of your work, as you will be tasked with automating tasks to improve efficiency and reduce manual effort. Collaboration with Application Engineering teams and other Site Reliability Engineers will be essential to ensure the reliability and scalability of our systems. Additionally, you will have the opportunity to learn and use tools like Terraform and Ansible to provision and manage our infrastructure, further enhancing your skill set and contributing to the team's success.