McDonald's - Chicago, IL

posted about 2 months ago

Full-time - Mid Level
Remote - Chicago, IL
Food Services and Drinking Places

About the position

The Senior Manager of Site Reliability Engineering (SRE) at McDonald's is responsible for leading the technical strategy and vision for reliability practices within the mobile application domain. This role involves overseeing a team of site reliability engineers, ensuring the development and implementation of effective reliability engineering practices, and collaborating with various engineering and product teams to enhance operational reliability and efficiency.

Responsibilities

  • Oversee design decisions and guide the team to achieve key results for assigned products.
  • Resolve issues using engineering principles and build proactive design solutions for potential failures.
  • Lead a team of site reliability engineers in developing a continuous reliability approach and implementing crucial SRE practices.
  • Design and drive monitoring, alerting, and ticket reporting strategies to measure SLA, SLO, MTTI, MTTR, and align with management expectations to minimize production downtime.
  • Guide site reliability automation to eliminate manual toil and build self-healing capabilities.
  • Participate in the selection of appropriate automation tools and define technology, quality, experience, and implementation standards within the technical domain.

Requirements

  • Bachelor's Degree in Technology or a related field, or equivalent experience.
  • 5+ years of professional experience as a Site Reliability Engineer, with at least 3+ years in a managerial role.
  • Experience with SRE design focused on reliability and resiliency.
  • Experience working in a cloud environment.
  • Experience in mobile development on iOS and Android.
  • Strong experience with DevOps, including CI/CD pipelines with Jenkins or similar tools, and Git/GitHub.
  • Proven skills in high availability and scalability design, as well as performance monitoring and testing.
  • Comfortable with production environments, firewalls, and networking.
  • Expertise in deploying, observing, altering, logging, and monitoring systems (e.g., New Relic, DataDog) with a focus on predictive analysis.
  • Strong interpersonal and written communication skills.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service