Sr. Engineer Site Reliability

$143,100 - $143,100/Yr

Albertsons - Pleasanton, CA

posted 3 months ago

Full-time - Mid Level
Remote - Pleasanton, CA
1,001-5,000 employees
Food and Beverage Retailers

About the position

The Technology & Engineering Department at Albertsons Companies is seeking a Senior Site Reliability Engineer to join the Retail Operations team located in Pleasanton, CA. This position is pivotal in ensuring the reliability and performance of our customer-facing applications. The ideal candidate will have a strong background in troubleshooting and incident resolution, particularly within a suite of applications that utilize modern technologies such as Java, Micro-services, Spring Boot, MS-Azure, Python, and React. The role involves configuring and implementing alert monitoring systems to proactively manage application performance and reliability. As a Senior Site Reliability Engineer, you will be responsible for diagnosing, isolating, and debugging production issues to swiftly resolve incidents that affect our customers. You will actively participate in post-incident root cause analysis and collaborate closely with development engineers to implement solutions for recurring problems. Your technical expertise will guide the diagnosis of issues as they arise, ensuring that critical applications operate smoothly. In addition to troubleshooting, you will contribute to the design and enhancement of critical services and applications, performing proactive analysis to predict and prevent production incidents. You will define and implement performance monitoring capabilities, design automated monitoring rules, and participate in the turnover of new platforms and technologies into the operational environment. Your role will also involve maintaining documentation for incidents and problems, preparing change documentation, and authoring knowledge articles in ServiceNow based on actionable monitoring alerts. This position requires a commitment to teamwork and collaboration, as you will manage multiple work streams and provide support for customer-facing activities that require 24/7 availability, including after-hours on-call rotation. The salary range for this position is between $109,700 and $143,100 annually, with starting salaries varying based on location, experience, and qualifications. Benefits include medical, dental, vision, disability and life insurance, sick pay, flexible time off, paid holidays, bereavement pay, and retirement benefits such as 401(k) eligibility.

Responsibilities

  • Diagnose, isolate, and debug production issues to quickly resolve customer-facing incidents.
  • Actively drive post-incident root cause analysis efforts.
  • Partner with development engineers to implement corrections to problems associated with supported applications.
  • Provide technical guidance in the diagnosis of issues as they arise in support of critical applications.
  • Drive collaboration sessions among IT and product groups to facilitate optimal performance, support, and operation of relevant services or applications.
  • Contribute to the design, implementation, and enhancement of critical services and applications.
  • Perform proactive analysis and troubleshooting to predict and prevent production incidents.
  • Define, contribute, and implement performance monitoring capabilities for critical services or applications.
  • Design, configure, and implement automated monitoring rules and alerts.
  • Participate in Production Turnover activities to bring new platforms and technologies into the environment.
  • Interface with Engineering Managers, Developers, and Build Experts to understand technology requests and business complexities.
  • Adhere to Incident, Problem and Change Management processes & best practices.
  • Maintain incident and problem ticket documentation.
  • Prepare change documentation & implement fixes for recurring issues.
  • Author and maintain knowledge articles in ServiceNow based on actionable monitoring alerts.
  • Provide knowledge transfer (KT) sessions among peers and offshore team members.
  • Collaborate with key vendors on functional, performance, and capacity improvements.
  • Foster teamwork and manage multiple work streams.
  • Provide support for customer-facing activities that require 24x7 availability, including after-hours On-Call rotation activities.

Requirements

  • 4-year degree in Computer Science, Information Systems, or related field, or equivalent combination of education or work experience.
  • 5 years of programming experience using various standard scripting languages and high-level programming languages.
  • Strong troubleshooting skills with the ability to quickly diagnose complex production issues.
  • Experience with application servers (WebSphere, WebLogic, and/or JBoss) and database technologies (Oracle, DB2, and/or SQL Server).
  • Experience in UI/Web 2.0 Development (JavaScript, CSS, Ajax, Adobe Flash/Flex, Dojo, YUI, and/or JQuery).
  • Strong knowledge of UNIX and Windows operating systems.
  • Experience creating and maintaining application processes and documentation.
  • Knowledge of current monitoring tools (Grafana, Azure Tools).
  • Exposure to network concepts and technologies.
  • Strong experience with the full software development lifecycle and software development methodologies (Agile).
  • Ability to understand client expectations and resolve issues that may affect service.
  • Strong interpersonal skills with the ability to work effectively across multiple levels of the organization.
  • Ability to mentor, coach, and train other application support engineers.
  • Self-starter with a demonstrated ability to learn beyond formal training and a strong aptitude for delivering quality products.

Nice-to-haves

  • Experience in a retail environment is preferred.

Benefits

  • Medical insurance
  • Dental insurance
  • Vision insurance
  • Disability insurance
  • Life insurance
  • Sick pay (accrued based on hours worked)
  • Flexible Time Off (PTO/Vacation Pay)
  • Paid holidays (8-9 days annually)
  • Bereavement pay
  • Retirement benefits (such as 401(k) eligibility)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service