Appleposted 10 days ago
Austin, TX
Computer and Electronic Product Manufacturing

About the position

Join the Apple Service Engineering team as a Site Reliability Engineer and be part of something extraordinary. At Apple, your ideas have the power to shape the future of our products, services, and customer experiences. Bring your passion and dedication, and watch your vision become reality. As an SRE, you'll play a pivotal role in supporting and scaling cloud services for thousands of development and operations engineers. Our services demand uncompromising scalability, high availability, and seamless performance. This is a hands-on position where you'll establish SRE practices for our private/public cloud service, accelerating our ability to deliver thousands of applications reliably and consistently. If you're driven by designing, engineering, and running systems that make a real difference for our customers, Apple is the perfect place for you.

Responsibilities

  • Ensure systems are reliable, secure, and scalable.
  • Maintain constant uptime and seamless scalability.
  • Collaborate with developers and architects to design and implement solutions.
  • Implement and coordinate telemetry using tools like Splunk, Grafana, and Prometheus.
  • Develop automation scripts and tools using Python and GoLang.
  • Create clear alert handling procedures and runbooks.

Requirements

  • Experience with major public cloud providers and their cloud-native services.
  • Familiarity with infrastructure as code (IaC) tools like Terraform or Ansible.
  • Proficiency in Kubernetes for deploying and troubleshooting container-based applications.
  • Adherence to SRE principles including monitoring, alerting, and automation.
  • Expertise in analyzing and troubleshooting complex system issues.
  • Excellent interpersonal and communication skills.
  • Technical (Engineering or Computer Science) BS/MS degree or equivalent work experience.

Nice-to-haves

  • Operate, monitor, and prioritize tasks across all production and non-production environments.
  • Design, build, and implement innovative software solutions.
  • Automate service deployment and orchestration in the cloud environment.
  • Participate in capability planning, scale testing, and disaster recovery exercises.
  • Foster strong relationships with partner teams like engineering, QA, and program management.

Job Keywords

Hard Skills
  • Ansible
  • Kubernetes
  • Prometheus
  • Python
  • Terraform
  • 8toHG6l0K 2CpkdwHVc
  • a6tin4OgHG0v2zr aPu z73v1
  • BwdnHaZVl 8uhlgmxw
  • ErSTjLd
  • gHRWbCd 8KWOefYH
  • hgu4i18m7QUc u5DNcnrLMdCF
  • hvgQrCGVPzMK EPg1zxOp9J0Z
  • hZrNOfCFL GsW0RVa
  • NFZtJDnCj1 k6oXQuNTzJHW
  • nqOmyx AuLJiHnCE
  • rFQHUDvE 4CfDqM2Smnl
  • rIioA1nl87 VwZWnS1cM
  • ukMU6se MsDJCF
  • Xd21jszHAn eG7q vOkLqjdVnh
  • XElLz1 VYH3FRzZE
  • z9BKnQDL
Soft Skills
  • xpzX4jUHTMywh9BZ
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service