CVS Health - Hartford, CT

posted 5 months ago

Full-time - Mid Level
Hartford, CT
Health and Personal Care Retailers

About the position

The Sr. Site Reliability Engineer - Incident Management role at CVS Health is a pivotal position within a nascent engineering team dedicated to enhancing incident response capabilities across the CVS DDAT organization. This role is centered around fostering a culture of incident management that not only addresses incidents effectively but also maximizes learning opportunities from each occurrence. The incident team is committed to empowering data engineers, ensuring they feel confident during on-call situations, and enhancing communication to resolve incidents efficiently. By partnering closely with various teams, the role aims to drive a holistic incident management strategy that promotes continuous improvement within the business. As a member of this engineering team, you will be responsible for driving incident management capabilities and cultivating a culture that prioritizes incident response. This includes contributing to incident command on-call duties, building technical skills, and fostering relationships within a diverse team of engineers and Site Reliability Engineers (SREs). The role emphasizes collaboration and cross-functional learning, allowing you to teach and learn from others while iterating towards effective solutions in an agile environment. The ideal candidate will have a strong background in incident management within cloud environments and a passion for working across various scopes, including software engineering, cloud platforms, and SRE practices. You will be expected to drive efficiency improvements in software at scale and collaborate effectively to instill an engineering culture that values empathy and compassion, particularly in customer-centric and agile organizations. Experience with SaaS or managed software offerings, as well as expertise in major public clouds, will be crucial for success in this role.

Responsibilities

  • Drive incident management capabilities and culture within the team.
  • Contribute to incident command on-call duties.
  • Build technical skills and relationships within a team of engineers and SREs.
  • Learn, teach, and collaborate cross-functionally to enhance incident response.
  • Work on a variety of scopes spanning software engineering, cloud platform, and SRE.

Requirements

  • 3+ years' professional experience with incident management in cloud environments.
  • 3+ years' experience working on infrastructure teams in customer-centric and agile organizations.
  • 3+ years' experience working with SaaS or another type of managed software offering.
  • 3+ years' experience in one or more of the major public clouds (GCP or AWS strongly preferred).
  • Bachelor's degree in IT or relevant field; or equivalent experience.

Nice-to-haves

  • Master's degree in computer science, Engineering, or a related technical field.
  • Prior experience working with large scale web-based Java architectures and JVM configuration.
  • Professional certifications in cloud platforms, monitoring tools, or related technologies.
  • Previous experience working on a large-scale ecommerce platform GCP.

Benefits

  • Full range of medical, dental, and vision benefits.
  • 401(k) retirement savings plan.
  • Employee Stock Purchase Plan for eligible employees.
  • Fully-paid term life insurance plan for eligible employees.
  • Short-term and long-term disability benefits.
  • Numerous well-being programs.
  • Education assistance and free development courses.
  • CVS store discount and discount programs with participating partners.
  • Paid Time Off (PTO) and paid holidays throughout the calendar year.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service