Evolent Health - Carson City, NV
posted about 2 months ago
As an Associate Site Reliability Engineer at Evolent, you will play a crucial role in managing our extensive application suite and cloud infrastructure. This position is part of the Platform Engineering organization, where you will be instrumental in transforming how we manage cloud infrastructure and application reliability. Your contributions will directly impact our ability to provide high-quality care to individuals with complex health conditions. We are looking for someone who is eager to join a talented team and is passionate about improving application reliability and performance. In this role, you will take ownership of identifying and implementing solutions for recurring application problems, which is essential for increasing application reliability. You will execute corrective actions identified during post-incident reviews (PIRs) or root cause analyses (RCAs) and participate in incident management, including after-hours support. Your responsibilities will also include maintaining observability solutions to gather and analyze system metrics from production systems, identifying performance bottlenecks through Application Performance Management (APM), and resolving these issues. Automation will be a key focus of your work, as you will be tasked with automating tasks to improve efficiency and reduce manual effort. Collaboration is vital in this role, as you will work closely with Application Engineering teams and other Site Reliability Engineers (SREs) to ensure the reliability and scalability of our systems. Additionally, you will have the opportunity to learn and utilize tools such as Terraform and Ansible for provisioning and managing infrastructure, further enhancing your skill set and contributions to the team.