Apple - Cupertino, CA
posted 4 months ago
As a Site Reliability Engineer (SRE) at Apple Services Engineering (ASE), you will play a pivotal role in ensuring the reliability and performance of the systems that support Apple’s services, including iCloud. This position is not just about maintaining systems; it’s about crafting experiences that millions of customers rely on daily. You will be part of a team that is responsible for the design, engineering, and operation of services that must scale globally and remain highly available. Your work will directly impact the quality of Apple Services, and you will be expected to bring your passion for engineering and problem-solving to the forefront. In this role, you will lead data-driven roadmaps and quarterly planning for a subset of core services, focusing on reliability. You will oversee the entire software lifecycle for these services, which includes infrastructure setup, capacity planning, deployment, monitoring, architecture, and software implementation. Collaboration with development teams will be crucial as you work to ensure that the services not only meet but exceed customer expectations. The ideal candidate will thrive in a fast-paced, collaborative environment and will be driven by a desire to solve complex engineering problems. Your responsibilities will include implementing SRE principles such as monitoring, alerting, and automation, as well as managing the lifecycle of global services from inception through deployment and operations. You will also be expected to participate in on-call service support, ensuring that any issues are addressed promptly and effectively. This is an opportunity to work at the intersection of software development and operations, making a significant impact on the reliability of Apple’s services.