Apple - Austin, TX

posted 3 months ago

Full-time - Manager
Austin, TX
Computer and Electronic Product Manufacturing

About the position

As a Site Reliability Engineering (SRE) Manager at Apple Service Engineering, you will play a pivotal role in supporting and scaling cloud services that cater to thousands of development and operations engineers. This position is not just about overseeing operations; it is a hands-on role that requires you to establish and implement SRE practices for a private cloud service. Your efforts will directly contribute to enhancing our ability to reliably and consistently deliver thousands of applications. You will lead a dedicated team responsible for maintaining the uptime of mission-critical cloud systems, ensuring they can scale seamlessly and support the introduction of new applications and services. In this role, you will be expected to collaborate closely with developers and architects to aid in the design and implementation of systems that improve stability, security, and scalability. Your leadership will be crucial in fostering a culture of excellence, quality, and attention to detail within your team. You will also be responsible for mentoring and growing your team, instilling a strong sense of ownership and a desire to understand the intricacies of the systems you manage. The ideal candidate will possess a passion for automation, a reluctance for manual processes, and a commitment to continuous improvement in all aspects of service delivery.

Responsibilities

  • Lead a team responsible for providing the platform for mission-critical cloud systems.
  • Establish SRE practices for a private cloud service.
  • Collaborate with developers and architects to aid in design and implementation.
  • Ensure constant uptime and seamless scaling of cloud services.
  • Support operations while improving stability, security, and scalability.
  • Mentor and grow the SRE team, fostering a culture of excellence and ownership.

Requirements

  • Experience with Cloud Computing technologies, particularly Kubernetes.
  • Experience with configuration management tools such as Puppet, Chef, or Ansible.
  • Strong background in managing enterprise services in a large-scale nix environment.
  • Desire to build, grow, and mentor a team.
  • Strong verbal and written communication skills.

Nice-to-haves

  • Skilled at working cross-functionally to achieve project success.
  • Strong philosophy of continuous improvement.
  • Ability to encourage and foster a culture of visibility and transparency across teams.
  • Experience troubleshooting issues across the entire software stack.
  • Experience operating large-scale multi-tenant infrastructure as a managed service.

Benefits

  • Health insurance coverage
  • 401k retirement savings plan
  • Paid holidays and vacation time
  • Professional development opportunities
  • Flexible scheduling options
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service