About The Position

The Sr. Software Engineer in Machine Learning Efficiency will play a crucial role in optimizing the performance of distributed machine learning workloads within Apple's Machine Learning Platform Technology & Infra team. This position involves collaboration with machine learning researchers and other key partners to enhance system performance and resilience, ensuring the efficient operation of large-scale ML applications.

Requirements

  • Experience working with large scale parallel and distributed accelerator-based systems
  • Experience optimizing performance and AI workloads at scale
  • Experience developing code in one or more of training frameworks (such as PyTorch, TensorFlow or JAX)
  • Strong communicator with ability to analyze complex and ambiguous problems
  • Programming and software design skills (proficiency in C/C++ and/or Python)
  • Experience working in a high-level collaborative environment and promoting a teamwork mentality
  • Bachelor's degree in Computer Science and 7+ years of work experience

Nice To Haves

  • Deep understanding of computer systems and the interactions between HW and SW
  • Experience in performance analysis and optimization experience in Cloud accelerators
  • Advanced degree in CS

Responsibilities

  • Engage with ML researchers to optimize end-to-end performance of large scale distributed ML workloads
  • Analyze workload metrics to identify sources of inefficiencies and work with users to understand and optimize ML workloads
  • Conduct workload analysis based on benchmarking key workloads on deployed systems
  • Improve large scale training resiliency by optimizing applications and frameworks for improved recovery from failures and preemptions
  • Influence architecture, design, development, and operations of next generation ML accelerator systems based on workload insights

Benefits

  • Comprehensive medical and dental coverage
  • Retirement benefits
  • Discounted products and free services
  • Reimbursement for certain educational expenses including tuition
  • Discretionary bonuses or commission payments
  • Relocation assistance

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Computer and Electronic Product Manufacturing

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service