Apple - Cupertino, CA

posted about 1 month ago

Full-time - Mid Level
Cupertino, CA
Computer and Electronic Product Manufacturing

About the position

The Foundation Model Infrastructure team at Apple is a critical component of the Machine Learning Platform Technologies organization, responsible for building frameworks, services, and tools that support Apple's foundation models. This role involves optimizing large-scale language, vision, and speech models to enhance the performance of various Apple services, including Apple Search, Apple Music, and Siri, ultimately impacting billions of users worldwide.

Responsibilities

  • Optimize inference for cutting-edge model architectures in collaboration with the Foundation Model Research team.
  • Build production-grade solutions to launch models serving millions of customers in real-time.
  • Develop tools to identify bottlenecks in inference across different hardware and use cases.
  • Mentor and guide engineers within the organization.

Requirements

  • 5+ years of experience leading and driving complex, ambiguous projects.
  • Experience with high throughput services, particularly at supercomputing scale.
  • Proficient in running applications on Cloud platforms (AWS / Azure or equivalent) using Kubernetes and Docker.
  • Familiarity with GPU programming concepts using CUDA.
  • Experience with popular ML frameworks like Pytorch or Tensorflow.

Nice-to-haves

  • Proficient in building and maintaining systems using modern programming languages (e.g., Golang, Python).
  • Familiarity with fundamental Deep Learning architectures such as Transformers and Encoder/Decoder models.
  • Experience with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, and Nvidia Triton Server.
  • Experience writing custom CUDA kernels using CUDA or OpenAI Triton.

Benefits

  • Comprehensive medical and dental coverage
  • Retirement benefits
  • Discounted products and free services
  • Reimbursement for certain educational expenses, including tuition
  • Discretionary bonuses or commission payments
  • Relocation assistance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service