The position involves implementing tooling to monitor AWS EKS-based systems with a focus on performance, reliability, and scalability. The role requires ensuring that architecture and deployment models are sufficient to support SLA commitments and are well prepared for future problems of scale. The candidate will leverage cloud technology and platform capabilities to provide operationally sustainable solutions that are robust and cost-effective. Additionally, applying software engineering best practices to comprehensively address and resolve problems is essential. The role also includes collaborating with product support teams to drive efficiency and enhance customer experience through self-service tools and automation, ensuring timely response to incidents and support requests, and conducting root cause analysis to implement preventative measures to minimize toil and impact on customers. Leading and participating in incident retrospectives to enhance future response efforts and participating in on-call rotations to provide critical support as needed are also key responsibilities.