CCC Intelligent Solutions - Chicago, IL
posted 3 months ago
As an Azure-Based Site Reliability Engineer (SRE) at CCC Intelligent Solutions, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications and services hosted primarily on Microsoft Azure. This position is designed for individuals who are passionate about cloud technologies and have a strong background in site reliability engineering. You will collaborate closely with development teams to design, build, and maintain the observability and alerting components of our services. Your experience with Azure services and multi-tenant SQL-based applications will be instrumental in optimizing our cloud architecture and driving continuous improvement in our systems. In this role, you will help build an SRE culture by sharing best practices, approaches, documentation, and code with other engineering teams across the organization. You will be responsible for designing, implementing, and managing the alerting and monitoring strategy for Azure-based services. Monitoring system performance and reliability will be a key part of your responsibilities, as you will implement monitoring solutions and alerts to ensure proactive responses to potential issues. You will also collaborate with development teams to optimize Azure-based applications based on our observability strategy. Your approach to operational issues will be rooted in a software development mindset, utilizing defined feedback loops within the software delivery lifecycle. You will perform root cause analysis for incidents and implement preventive measures to minimize future disruptions. Staying updated with Azure technologies and best practices will be essential, as you will recommend and implement improvements to enhance application and system performance and efficiency. Additionally, you will participate in on-call rotations and respond to incidents as needed, ensuring timely resolution and communication. Coaching other team members to ensure systems are supported by following SRE best practices will also be part of your role.