AIML - ML Engineer, Machine Learning Platform & Infra

Apple - Seattle, WA

posted 26 days ago

Full-time - Mid Level

Seattle, WA

Computer and Electronic Product Manufacturing

About the position

The AIML - ML Engineer position focuses on optimizing machine learning models and infrastructure within Apple's Foundation Model Infrastructure team. This role involves collaborating with research and product teams to develop production-grade solutions that serve millions of customers in real-time, enhancing the performance of various Apple services. The engineer will work on cutting-edge model architectures, analyze bottlenecks in inference, and mentor other engineers in the organization.

Responsibilities

Optimize inference for cutting-edge model architectures in collaboration with the Foundation Model Research team.
Build production-grade solutions to launch models serving millions of customers in real-time.
Develop tools to identify and understand bottlenecks in inference across different hardware and use cases.
Mentor and guide engineers within the organization.

Requirements

5+ years of experience leading and driving complex, ambiguous projects.
Experience with high throughput services, particularly at supercomputing scale.
Proficient in running applications on Cloud platforms (AWS, Azure, or equivalent) using Kubernetes and Docker.
Familiarity with GPU programming concepts using CUDA.
Experience with popular ML frameworks such as PyTorch or TensorFlow.

Nice-to-haves

Proficient in building and maintaining systems using modern programming languages (e.g., Golang, Python).
Familiarity with fundamental deep learning architectures such as Transformers and Encoder/Decoder models.
Experience with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, and Nvidia Triton Server.
Experience writing custom CUDA kernels using CUDA or OpenAI Triton.

AIML - ML Engineer, Machine Learning Platform & Infra

About the position

Responsibilities

Requirements

Nice-to-haves

Tools

Career Hubs

Guides

Company