Apple - Cupertino, CA

posted 15 days ago

Full-time - Senior
Cupertino, CA
Computer and Electronic Product Manufacturing

About the position

The Senior Software Engineer for the Machine Learning Platform and Infrastructure at Apple is responsible for designing and building cloud-native infrastructure platforms that support AI/ML products, including Siri and Search. This role involves architecting scalable systems, leading the development of distributed systems, and collaborating with various teams to ensure seamless integration of platform solutions. The engineer will also mentor junior engineers and drive the adoption of cloud-native technologies across the organization.

Responsibilities

  • Architect and develop scalable cloud-native platforms to support the deployment and operation of Apple's AI/ML services and applications.
  • Lead the design and implementation of distributed systems that are resilient, secure, and optimized for high performance.
  • Collaborate with AI/ML teams and other infrastructure engineers to ensure seamless integration of platform solutions into machine learning workflows.
  • Drive the adoption of cloud-native technologies across the organization, influencing architectural decisions and best practices.
  • Troubleshoot complex infrastructure issues, ensuring optimal performance and reliability across multi-cloud environments.
  • Mentor junior engineers and contribute to the team's overall growth and technical excellence.

Requirements

  • B.S., M.S., or Ph.D. in Computer Science, Computer Engineering, or equivalent practical experience.
  • 7+ years programming experience in languages such as Golang, Python, Java, Rust, C/C++ etc.
  • Proven track record of building large scale, cloud based complex distributed large-scale Kubernetes based systems; identifying shortcomings and optimization opportunities; and making data driven cost performance tradeoffs to influence design decisions.
  • Experience managing cloud environments (AWS, GCP) infrastructure as code (IaC).
  • Experience building and operating infrastructure and tools that empower developers/product teams to deliver business value.
  • Experience leading strong technical teams that balance operational excellence, collaboration and user needs.

Nice-to-haves

  • 10+ years of experience in cloud infrastructure and distributed systems engineering.
  • Strong Proficiency in Go or Rust with experience in building automation tools and distributed systems.
  • Proficiency in Kubernetes controller development and automating cluster management.
  • Strong knowledge of CI/CD frameworks and DevOps practices.
  • Familiarity with monitoring and observability tools to ensure high availability and performance.
  • Strong problem-solving skills, capable of debugging issues that span heterogenous compute environments.
  • Excellent communication and collaboration skills, with a proven ability to work across engineering teams and influence architectural decisions.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service