This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Adobe Systems Incorporatedposted about 2 months ago
$153,600 - $286,600/Yr
Mid Level
Seattle, WA
Publishing Industries
Resume Match Score

About the position

We're looking for an outstanding, Site Reliability Engineer for Adobe's AI Inference Platform, Adobe Firefly. You will be part of a team of Site Reliability Engineers closely working with the Engineering teams on building, scaling, and securing the AI Platform. This enables the Firefly product teams to easily manage and deploy Machine Learning capabilities used by Adobe client applications. The Applied Research groups from Adobe Research and other App Teams in Adobe will deploy thousands of models onto this platform in a variety of lifecycle stages (early research, development, productization, optimization, etc). This platform will offer ML model serving at scale, with high-cost efficiency, and on a wide variety of hardware platforms across multiple clouds.

Responsibilities

  • Identify and implement methodologies and solutions to increase reliability, scalability, security, and efficiency.
  • Ensure the highest uptime and Quality of Service (QoS) for Adobe's customers through operational excellence.
  • Define service level objectives (SLOs) and indicators (SLIs) to represent and measure service quality.
  • Support and maintain globally distributed, multi-cloud (public and/or private) environments.
  • Automate common, repeatable tasks at a large scale to streamline operational procedures.
  • Identify areas to improve service resiliency through techniques such as chaos engineering, performance/load testing, etc.
  • Coordinate with other Adobe platform teams and service providers (primarily AWS) to innovate on Generative AI as a Service.

Requirements

  • A Bachelor's or Master's degree in Computer Science, Electrical Engineering, a related field, or equivalent industry experience.
  • You excel in undefined environments and get excited about finding pragmatic solutions to complex technical or organizational challenges.
  • You keep up with the industry trends and grow your knowledge and skills to solve technical problems.
  • Experience in building and scaling distributed systems, as well as experience with containerization and orchestration technologies like Kubernetes.
  • Production level expertise with containerization orchestration engines (e.g. Kubernetes) and proven understanding of modern, continuous development techniques and pipelines (IaC, CI/CD, ArgoCD, Git).
  • Fundamental programming skills, ideally practical experience in one (and preferably more) of the following languages: Python, Go.
  • Good knowledge of infrastructure configuration management tools like Ansible and Terraform.
  • Experience in using observability and tracing-related tools like InfluxDB, Prometheus, and Elastic Stack.
  • An understanding of AI/ML, including ML frameworks, public cloud, and commercial AI/ML solutions - familiarity with Pytorch, SageMaker, HuggingFace, NVIDIA TensorRT or OpenAI Triton a plus.

Benefits

  • Compensation reflects the cost of labor across several U.S. geographic markets.
  • Pay range for this position is $153,600 -- $286,600 annually.
  • Eligible for long-term incentives in the form of a new hire equity award.

Job Keywords

Hard Skills
  • Ansible
  • Elastic Stack
  • Go
  • InfluxDB
  • Prometheus
  • 1WFKBNv ZtIEUm
  • 6cdKYFzMrOUH0G FlHJiroKqm1
  • dNAxalsLZ1
  • dr75nGYy sRl2jp
  • fhUvHx7IjNMl DdBfNqYFS0i
  • fIM2Tb7O5F6x nIc2roMX6TyJ
  • GPc4ULaJqO0 PpwKAsxEzq14
  • JBvtzx IhqxkWgVDnz
  • k9HoZ6j 5RgKhI
  • OFv2CyUW iJ9Cb1sSL3Zc
  • PvaBebF8I6js CniGyNTKLEOD
  • q4RSGOFDzyC7 F8HjKQi0wWJN
  • qD4xRSIs
  • rKF7a YXlMfoxc
  • U3C4oGK
  • UwgI0Qzj2 3YbHwPXC
  • Y0EkUFsp SuavVg4Ae
  • y2XDPViT CiF3UKVhJ
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service