Mastercard - O'Fallon, MO
posted 3 months ago
The Lead Site Reliability Engineer (SRE) position at Mastercard is a pivotal role within the Enterprise Data Accessibility BizOps team, aimed at enhancing the reliability and efficiency of large-scale, distributed services and infrastructures. The SRE will leverage their expertise in software and systems engineering to build and manage cloud operations, CI/CD pipelines, and automation best practices. This role is essential for ensuring that the services and infrastructures are not only reliable and fault-tolerant but also scalable and cost-effective. The SRE will be responsible for overseeing the production environment, ensuring operational readiness, and collaborating closely with developers to implement technology services that meet operational criteria such as system availability, performance, and deployment automation. In this role, the SRE will engage in a variety of tasks including defining strategies for application performance monitoring, managing incident responses, and maintaining services post-launch by monitoring system health and availability. The SRE will also be involved in continuous optimization efforts within the production environment, ensuring that the systems are resilient and capable of handling the demands placed upon them. A significant aspect of the role involves practicing sustainable incident response and conducting blameless postmortems to foster a culture of learning and improvement. The SRE will work with a global team, requiring effective communication and collaboration across different time zones. This position not only demands technical expertise but also a systematic problem-solving approach, strong communication skills, and a proactive mindset to drive improvements in customer experience and operational efficiency. The SRE will play a crucial role in the DevOps transformation at Mastercard, advocating for change and standardization across development, quality, release, and product organizations, ultimately aligning product priorities with operational needs.