Senior Site Reliability Engineer

$111,260 - $183,580/Yr

Red Hat - Lowell, MA

posted 11 days ago

Full-time - Senior
Remote - Lowell, MA
Professional, Scientific, and Technical Services

About the position

Red Hat is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate OpenShift managed cloud services. The role involves contributing to the scalability and reliability of services, enabling customer self-service, and automating processes. The SRE will work within a hybrid model from specified locations and will have the opportunity to influence complex challenges unique to Red Hat managed cloud services.

Responsibilities

  • Contribute code to increase the scalability and reliability of the service
  • Contribute software tests and participate in peer review to increase the quality of our codebase
  • Help and develop peers' capabilities through knowledge sharing, mentoring, and collaboration
  • Participate in a regular on-call schedule, including occasional paid weekends and holidays
  • Practice sustainable incident response and blameless postmortems
  • Resolve customer issues escalated from the Red Hat Global Support team
  • Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve

Requirements

  • 3+ years of software engineering experience using object-oriented languages; Python or Golang (Golang preferred)
  • 3+ years experience managing Linux-based systems in a public cloud such as AWS, GCP, or Azure
  • 3+ years experience with enterprise systems monitoring; knowledge of Prometheus is preferred
  • 2+ years coaching and mentoring team members in technology and customer support
  • 1+ years experience delivering hosted cloud services
  • 1+ year experience with Kubernetes
  • 1+ year experience with containers on Linux
  • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
  • Excellent communications skills in a global team environment
  • Demonstrated ability to quickly and accurately troubleshoot systems issues

Nice-to-haves

  • Experience engineering solutions which meet requirements of security and compliance frameworks like ISO 27001, PCI-DSS, and FedRAMP is highly desirable
  • Direct experience with Kubernetes or OpenShift is a plus

Benefits

  • Comprehensive medical, dental, and vision coverage
  • Flexible Spending Account - healthcare and dependent care
  • Health Savings Account - high deductible medical plan
  • Retirement 401(k) with employer match
  • Paid time off and holidays
  • Paid parental leave plans for all new parents
  • Leave benefits including disability, paid family medical leave, and paid military leave
  • Employee stock purchase plan
  • Family planning reimbursement
  • Tuition reimbursement
  • Transportation expense account
  • Employee assistance program
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service