Microsoft - Redmond, WA

posted 3 months ago

Full-time - Senior
Remote - Redmond, WA
Publishing Industries

About the position

As a Senior Machine Learning Research Engineer, you will be at the forefront of innovating the latest hardware design to propel cloud growth. This unique career opportunity combines technical capabilities, cross-team collaboration, and business insight and strategy. You will join the Strategic Planning and Architecture (SPARC) team within the Azure Hardware System & Infrastructure (AHSI) organization, which is responsible for expanding cloud infrastructure and powering the "Intelligent Cloud" mission. This mission delivers more than 200 online services to over one billion individuals worldwide, with AHSI being the backbone of this expanding cloud infrastructure. You will be involved in delivering the core infrastructure and foundational technologies for cloud businesses, including Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams, and Xbox Live. The SPARC organization manages Azure's hardware roadmap from architecture concept through production for all current and future online services. In this role, you will be a highly motivated Senior Machine Learning Research Engineer with a solid background in neural networks and hardware implementation. Your responsibilities will include model development, data type analysis, and ML/HW co-design. The mission of the organization is to empower every person and every organization on the planet to achieve more. As an employee, you will come together with a growth mindset, innovate to empower others, and collaborate to realize shared goals. Each day, you will build on the values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. The organization is committed to cultivating an inclusive work environment for all employees to positively impact the culture every day.

Responsibilities

  • Driving model/HW co-design.
  • Developing and analyzing novel NN architecture.
  • Inventing novel low-precision data formats.
  • Inventing novel model architecture.
  • Collaborating with data scientists and ML researchers.
  • Interfacing with HW architecture team.
  • Interfacing with SW framework team.
  • Embodying our culture and values.

Requirements

  • Doctorate or Master's in a relevant field or equivalent experience in ML system/model optimization/efficient model architecture.
  • Ability to meet customer and/or government security screening requirements, including a Cloud Background Check upon hire/transfer and every two years thereafter.
  • 3+ years of experience in ML system/model optimization/efficient model architecture.
  • Track record of original research and delivering novel results in ML system area.
  • Hands-on experience with frameworks such as PyTorch/TensorFlow/TensorRT.
  • Deep knowledge of CNN/transformer architecture and optimization strategies - quantization, parity, NAS, hardening, KV Cache, Flash Attention.
  • Solid programming skills in Python/C/C++.
  • Experience in implementing low-level linear algebra/BLAS kernel and performance optimization.
  • Outstanding communication skills.

Nice-to-haves

  • Master's Degree/PhD in Machine Learning, Computer Architecture/System, High-Performance Computing or related area.

Benefits

  • Industry leading healthcare
  • Educational resources
  • Discount on products and services
  • Savings and investment
  • Maternity and paternity leave
  • Generous time away
  • Giving program
  • Opportunities to network and connect
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service