Site Reliability Engineer

$234,200 - $234,200/Yr

Adobe - San Jose, CA

posted 3 months ago

Full-time - Mid Level
San Jose, CA
5,001-10,000 employees
Publishing Industries

About the position

At Adobe, we are dedicated to changing the world through digital experiences. Our mission is to empower everyone—from emerging artists to global brands—with the tools they need to design and deliver exceptional digital experiences. We are passionate about enabling individuals to create beautiful and powerful images, videos, and applications, transforming how companies interact with their customers across every screen. We are committed to hiring the very best talent and creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We believe that new ideas can come from anywhere within the organization, and we recognize that the next big idea could be yours! The DXUE team, part of Adobe Experience Cloud, is focused on building a developer ecosystem and creating next-generation products on the Adobe Experience Platform. Our team is responsible for producing a set of highly performant reusable services and shared UI components. We work on all aspects of software engineering, developing highly available services and building applications powered by these services. In this individual contributor position based in our San Jose office, you will be responsible for developing software and tools, providing hands-on technical expertise to design, deploy, and optimize Cloud services. You will build automation using industry-standard tools such as Jenkins, Terraform, and Spinnaker to deploy services. Your role will involve participating in release cycles, deploying code to staging and production environments, and integrating with continuous integration (CI) and continuous delivery (CD) tools while monitoring the services. You will also identify and propose solutions for high-risk architecture issues, system bottlenecks, and work on improving the security and availability of our services. Additionally, you will participate in on-call duties, drive issues to resolution, and produce tools that enhance team productivity.

Responsibilities

  • Develop software/tools and provide hands-on technical expertise to design, deploy, and optimize Cloud services.
  • Build automation using industry-standard tools such as Jenkins, Terraform, and Spinnaker to deploy services.
  • Participate in release cycles of our services, deploying code to staging and production environments, integrating with CI and CD tools, and monitoring.
  • Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions.
  • Identify system bottlenecks and recommend solutions to address availability issues.
  • Participate in On-Call and drive any issues found to resolution and give postmortems.
  • Proactively work on efficiency and capacity planning to set clear requirements and reduce system resource usage.
  • Evangelize SRE principles and guide the development team to build reliable services.
  • Produce tools that will increase the productivity of teams.

Requirements

  • At least 5 years of experience as SRE in Cloud engineering.
  • Experience in automation and tool development.
  • At least 3 years of experience building Cloud services and distributed systems, including deployment, monitoring, scaling, and debugging.
  • Proficient in multi-cloud environments: AWS, Azure.
  • Experienced in writing applications using Java, Python, or JavaScript.
  • Experience in scaling to the limit with highly performant systems.
  • Crafted resilient solutions to ensure reliability.
  • Ability to articulate tradeoffs between cost, performance, and quality when improving services.
  • Provided detailed reporting and analysis through metrics and logs.
  • Experience with container technologies: Kubernetes, Docker.
  • Experience with New Relic, Splunk, Prometheus is a plus.
  • Excellent communication skills and a hunger to learn and succeed in a fast-paced environment.

Nice-to-haves

  • Experience with additional cloud platforms beyond AWS and Azure.
  • Familiarity with other programming languages such as Go or Ruby.
  • Knowledge of security best practices in cloud environments.

Benefits

  • Competitive salary and performance bonuses.
  • Health insurance coverage.
  • 401(k) retirement savings plan with company matching.
  • Paid time off and holidays.
  • Tuition reimbursement for further education.
  • Professional development opportunities.
  • Flexible work hours and remote work options.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service