Senior Site Reliability Engineer, Object Storage

Apple - Cupertino, CA

posted 5 months ago

Full-time - Senior

Cupertino, CA

Computer and Electronic Product Manufacturing

About the position

The Senior Site Reliability Engineer (SRE) for Object Storage at Apple Services Engineering (ASE) plays a crucial role in ensuring the reliability and performance of Apple's object store orchestration service. This position involves supporting and maintaining the service by measuring and monitoring its availability, latency, and overall system health. The SRE will develop, run, and support various SRE tools and applications, engaging in the entire lifecycle of services from inception through deployment, operations, and refinement. This includes analyzing logs and telemetry data, writing monitoring and automation code, and participating in on-call and release manager rotations. In this role, the engineer will provide technical expertise and troubleshooting during service-level impacting events, ensuring that any issues are resolved swiftly and effectively. The SRE will also participate in code reviews, contribute to internal infrastructure improvements, and enhance processes to optimize service delivery. Operating applications at scale across multiple geographically dispersed public and private clouds is a key responsibility, supporting Apple's mission-critical internal efforts. Collaboration with dependent teams and customers through clear communication is essential to ensure alignment and successful project execution.

Responsibilities

Support and maintain object store orchestration service measuring and monitoring availability, latency and overall system health.
Develop, run and support SRE tools and applications.
Engage in improving the whole lifecycle of services from inception through deployment, operations and refinement.
Analyze logs and telemetry data by writing monitoring and automation code.
Participate in on-call and release manager rotations.
Provide technical expertise and troubleshooting during service level impacting events.
Participate in code review, internal infrastructure improvements and process enhancements.
Operate our application at scale, across multiple geographically dispersed public and private clouds, to support Apple's mission critical internal efforts.
Collaborate with dependent teams and customers through clear communications.

Requirements

BS degree in computer science or equivalent field with 5+ years of experience
At least 5 years in a Site Reliability Engineering, DevOps or infrastructure focused role.

Nice-to-haves

Lower level understanding of the Linux Operating System, standard networking protocols, and components
Experience with containers and orchestration via Kubernetes in public / private clouds
Hands-on experience managing large numbers of diverse systems with configuration management, infrastructure provisioning tools or software delivery platforms (such as Terraform and Spinnaker)
Excellent troubleshooting and problem solving skills.

Benefits

Health insurance
Dental insurance
401k plan
Paid holidays
Flexible scheduling
Professional development opportunities

Senior Site Reliability Engineer, Object Storage

About the position

Responsibilities

Requirements

Nice-to-haves

Benefits

Tools

Career Hubs

Guides

Company