Apple - Cupertino, CA

posted 4 months ago

Full-time - Senior
Cupertino, CA
Computer and Electronic Product Manufacturing

About the position

The Senior Site Reliability Engineer (SRE) for Object Storage at Apple Services Engineering (ASE) plays a crucial role in ensuring the reliability and performance of Apple's object store orchestration service. This position involves supporting and maintaining the service by measuring and monitoring its availability, latency, and overall system health. The SRE will develop, run, and support various SRE tools and applications, engaging in the entire lifecycle of services from inception through deployment, operations, and refinement. This includes analyzing logs and telemetry data, writing monitoring and automation code, and participating in on-call and release manager rotations. In this role, the engineer will provide technical expertise and troubleshooting during service-level impacting events, ensuring that any issues are resolved swiftly and effectively. The SRE will also participate in code reviews, contribute to internal infrastructure improvements, and enhance processes to optimize service delivery. Operating applications at scale across multiple geographically dispersed public and private clouds is a key responsibility, supporting Apple's mission-critical internal efforts. Collaboration with dependent teams and customers through clear communication is essential to ensure alignment and successful project execution.

Responsibilities

  • Support and maintain object store orchestration service measuring and monitoring availability, latency and overall system health.
  • Develop, run and support SRE tools and applications.
  • Engage in improving the whole lifecycle of services from inception through deployment, operations and refinement.
  • Analyze logs and telemetry data by writing monitoring and automation code.
  • Participate in on-call and release manager rotations.
  • Provide technical expertise and troubleshooting during service level impacting events.
  • Participate in code review, internal infrastructure improvements and process enhancements.
  • Operate our application at scale, across multiple geographically dispersed public and private clouds, to support Apple's mission critical internal efforts.
  • Collaborate with dependent teams and customers through clear communications.

Requirements

  • BS degree in computer science or equivalent field with 5+ years of experience
  • At least 5 years in a Site Reliability Engineering, DevOps or infrastructure focused role.

Nice-to-haves

  • Lower level understanding of the Linux Operating System, standard networking protocols, and components
  • Experience with containers and orchestration via Kubernetes in public / private clouds
  • Hands-on experience managing large numbers of diverse systems with configuration management, infrastructure provisioning tools or software delivery platforms (such as Terraform and Spinnaker)
  • Excellent troubleshooting and problem solving skills.

Benefits

  • Health insurance
  • Dental insurance
  • 401k plan
  • Paid holidays
  • Flexible scheduling
  • Professional development opportunities
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service