Apple - Seattle, WA

posted 26 days ago

Full-time - Mid Level
Seattle, WA
Computer and Electronic Product Manufacturing

About the position

The AIML - ML Engineer position focuses on optimizing machine learning models and infrastructure within Apple's Foundation Model Infrastructure team. This role involves collaborating with research and product teams to develop production-grade solutions that serve millions of customers in real-time, enhancing the performance of various Apple services. The engineer will work on cutting-edge model architectures, analyze bottlenecks in inference, and mentor other engineers in the organization.

Responsibilities

  • Optimize inference for cutting-edge model architectures in collaboration with the Foundation Model Research team.
  • Build production-grade solutions to launch models serving millions of customers in real-time.
  • Develop tools to identify and understand bottlenecks in inference across different hardware and use cases.
  • Mentor and guide engineers within the organization.

Requirements

  • 5+ years of experience leading and driving complex, ambiguous projects.
  • Experience with high throughput services, particularly at supercomputing scale.
  • Proficient in running applications on Cloud platforms (AWS, Azure, or equivalent) using Kubernetes and Docker.
  • Familiarity with GPU programming concepts using CUDA.
  • Experience with popular ML frameworks such as PyTorch or TensorFlow.

Nice-to-haves

  • Proficient in building and maintaining systems using modern programming languages (e.g., Golang, Python).
  • Familiarity with fundamental deep learning architectures such as Transformers and Encoder/Decoder models.
  • Experience with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, and Nvidia Triton Server.
  • Experience writing custom CUDA kernels using CUDA or OpenAI Triton.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service