We are seeking an experienced and strategic Site Reliability Engineer (SRE) to drive the stability, reliability, and observability of our mission-critical systems. This role is crucial to ensuring high availability, performance, and operational excellence for our services. The SRE will be responsible for designing and implementing robust reliability frameworks, overseeing system monitoring, incident response, and leading key initiatives to improve system performance. This role requires a strong leadership mindset, balancing proactive risk mitigation with rapid incident response. The ideal candidate will work closely with engineering, operations, and leadership teams to define and uphold service-level objectives (SLOs) and optimize system resilience.
A Smarter and Faster Way to Build Your Resume