Alibaba Cloudposted 2 months ago
$133,200 - $219,600/Yr
Full-time • Mid Level
Seattle, WA

About the position

Elastic Compute Service (ECS) is a core product of Alibaba Cloud. The Elastic Compute team is dedicated to building world-leading cloud computing infrastructure. As a key component of Alibaba Cloud's self-developed Apsara operating system, Elastic Compute Service (ECS) provides full-stack computing resources covering virtual machine instances, container services, and Heterogeneous computing clusters. Through technological innovation and product optimization, the Alibaba Cloud Elastic Compute team continuously drives advancements in cloud computing technologies, delivering high-quality computing services to users worldwide. Our goal is not only to support enterprises in achieving elastic scalability but also to deeply empower infrastructure innovation in the New era. Our mission is to build an intelligent foundation of 'Computing as a Service,' enabling developers to focus on businesses to concentrate on breakthroughs, without worrying about the complex engineering implementations from chips to clusters. The Alibaba Cloud Elastic Compute Service (ECS) SRE (Site Reliability Engineering) team is a critical force in ensuring system stability and reliability. The SRE team focuses on guaranteeing the high availability, high performance, and robust stability of ECS products through technical expertise and innovation. The Alibaba Cloud ECS SRE team is not only a core technical safeguard but also a driver of technological innovation and continuous optimization. By leveraging technical capabilities and collaborative teamwork, we ensure the stability and reliability of ECS products, safeguarding global customers' businesses. Additionally, we are committed to advancing cloud computing technologies through knowledge sharing and industry collaboration. Joining the Alibaba Cloud ECS SRE team offers the opportunity to engage in the development and optimization of world-leading cloud computing technologies, while growing alongside a passionate and creative team.

Responsibilities

  • Oversee the stability, performance optimization, monitoring, and operational work for multiple core products of Alibaba Cloud (such as ECS, ACK, ACS, Heterogeneous computer cluster, OOS, Compute Nest, etc.), taking responsibility for the online stability of these products.
  • Engage in the development of operation systems and some online systems. Through tools, process optimization, and system improvements, ensure the stability and performance of Alibaba Cloud's Elastic Computing-related products.
  • Work closely with other teams (such as R&D, after-sales support, etc.) to ensure efficient technical support and problem resolution.

Requirements

  • Bachelor's degree or higher in Computer Science, Information Technology, or a related field.
  • At least 3 years of experience in system operations or SRE, with familiarity in cloud computing services and core products (e.g., ECS, K8S, Heterogeneous Computer, etc.).
  • Familiarity with the design and optimization of cloud resource provisioning and delivery systems; experience in serving overseas customers is preferred.
  • In-depth understanding of the overall architecture and operational mechanisms of the elastic computing product line, with the ability to quickly identify and resolve complex issues.

Nice-to-haves

  • Possession of cloud-related certifications (e.g., ACP, ACE, or other major cloud vendor certifications).
  • Participation in the architectural design or performance optimization projects of large cloud platforms.
  • Outstanding contributions in system stability assurance, automation tool development, or cloud-native domains are highly valued.

Benefits

  • Medical, dental, and vision insurance
  • 401(k) plan
  • Basic life insurance
  • Wellbeing benefits like FSA
  • Up to 12 paid holidays
  • Accrue up to 15 paid vacation days
  • Receive up to 72 hours paid sick time (front-loaded) per calendar year

Job Keywords

Hard Skills
  • Alibaba Cloud
  • Cloud Computing
  • Elasticity Computing
  • Information Sciences
  • Reliability Engineering
  • 0GNfI1V9tv8 mlchxwCeLo
  • 2wRpM JKA7dEva
  • 4bxJkRnKhF BJexfoh I8VWZhGYDumg
  • 6CwMYloc njxdSGI9X748t
  • 7i8nIJXuZS4 U74przNWvK
  • bSFusXl4 xAeKrJk76FX
  • E2C16jtimae qH7Xme1VnB
  • eJisN4vk 6Z9iqPF
  • EumSxLP73G 478EXpLB
  • fyagWVpXJ7w T57FMExcgk
  • hIVkCBpjMoa K39ctIfOnS
  • Jk21ew RBq7uiDkGt
  • jxKgvC Z5pELe TMo2hIfO
  • KuCrD0SIAV 36xY0e1 t9hzG7ypPKBo
  • LeV7NvGT0tI SD36O
  • LHg 2vRCzySKdcXrP jlM2OAu
  • NHl9m1ykZFS P3AVkFHeJpMN5
  • rApVQv VGOM1uASPQ
  • txOC6jL1K 2PN5Cd
  • U7ZuTxNf hM5iVvagC
  • ZHKLA yZvjbhED
  • ZTQIs A4aUsvicjBNn
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service