Intuit - Atlanta, GA

posted 2 months ago

Full-time - Senior
Atlanta, GA
10,001+ employees
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

Mailchimp is a leading marketing platform for small businesses, empowering millions of customers worldwide to build their brands and grow their companies with a suite of marketing automation, multichannel campaigns, CRM, and analytics tools. We are seeking an Engineering Leader to lead our Site Reliability Engineering Team. In this pivotal role, you will be responsible for ensuring the reliability, scalability, and performance of our application, which is utilized by both internal engineers and external customers. Your collaboration with cross-functional teams will be essential in designing, implementing, and maintaining systems that are robust and resilient. Furthermore, you will be tasked with driving a cultural change of operational excellence across the organization. We are looking for experienced leaders who possess a deep technical background, grounded in years of hands-on development in high-scale, highly available systems that have achieved outstanding levels of operational excellence. The ideal candidate will have taken these learnings and applied them at scale within their organization. As part of Intuit Mailchimp, you will work in a hybrid workplace, allowing you to collaborate in person with team members in our Atlanta, GA or New York, NY offices two or more days per week.

Responsibilities

  • Drive a mindset of operational excellence across the Mailchimp Engineering organization
  • Design and implement strategies for site reliability operations, including automation, monitoring, and maintenance processes
  • Coach and develop engineers responsible for site reliability and performance
  • Stay up-to-date with industry trends and emerging technologies to drive continuous improvement
  • Coordinate with cross-functional teams, including engineering, operations, support, and product teams to ensure the reliability and consistency of our services
  • Collaborate with other operational excellence teams across Intuit on shared best practices and learnings
  • Provide technical guidance and mentorship to team members and stakeholders

Requirements

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field, or equivalent experience
  • 8+ years of experience in Site Reliability Management, with 3+ years in a management role
  • Proven track record of managing teams of engineers and developing strategies for site reliability and performance operations
  • Excellent communication skills and ability to lead cross-functional teams and stakeholders
  • Proactive and results-driven attitude, with a passion for building reliable, scalable, and performant systems
  • Proficiency in programming languages such as PHP, Go, Python and Java
  • Strong understanding of Linux/Unix systems and network protocols
  • Experience with cloud platforms such as AWS and/or Google Cloud
  • Expertise in containerization and orchestration technologies like Docker and Kubernetes
  • Proficient in using monitoring and observability tools (e.g., Prometheus, Grafana, Splunk)
  • Experience with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI, Github Actions, etc)
  • Knowledge of database management systems (SQL and MySQL) and caching technologies
  • Familiarity with infrastructure as code (IaC) and configuration management tools (e.g., Terraform, Puppet)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service