Engineering Manager, SRE

$131,300 - $257,700/Yr

Services en nuage Genesys - Durham, NC

posted about 2 months ago

Full-time - Mid Level
Remote - Durham, NC
5,001-10,000 employees

About the position

The Service Reliability discipline at Genesys focuses on ensuring the reliability, observability, and recoverability of services operating at scale. This hands-on role involves developing and implementing tools and techniques to support the Genesys Cloud's growth, directly influencing system success through technical expertise.

Responsibilities

  • Develop Observability Strategy: Craft and maintain a comprehensive observability strategy that aligns with organizational goals, ensuring system reliability and performance.
  • Establish Best Practices: Define and promote best practices for monitoring, alerting, and observability across the engineering organization.
  • Hands-On Development: Actively participate in the development and integration of observability solutions, including OpenTelemetry implementations, Terraform integrations, and microservices enhancements.
  • Budget Management: Contribute to managing the observability budget, focusing on AWS services and third-party vendor costs.
  • Consultation and Support: Assist internal teams with the instrumentation, monitoring, and alerting of microservices on AWS, ensuring optimal system performance.
  • Vendor Relationship Management: Manage and optimize relationships with observability vendors to ensure seamless integration and service utilization.
  • Security and Compliance: Ensure that all observability systems and integrations comply with security and data privacy requirements.
  • Incident Response: Participate in on-call rotations, handle incidents, and contribute to post-incident analyses to continuously improve system reliability.
  • Mentorship and Development: Mentor team members, fostering their professional growth and contributing to the hiring and onboarding of new team members.

Requirements

  • Strong understanding of observability patterns (metrics, traces, logs)
  • Proven experience in implementing monitoring and alerting solutions for cloud-native microservices
  • Familiarity with OpenTelemetry and related technologies
  • Previous experience leading software engineering teams
  • Practical experience with AWS services
  • Proficiency in root cause analysis and incident response methodologies
  • Hands-on experience in developing and integrating technical solutions
  • Experience with on-call support for production systems
  • Bachelor's degree in Computer Science or a related field

Benefits

  • Medical, Dental, and Vision Insurance
  • Telehealth coverage
  • Flexible work schedules and work from home opportunities
  • Development and career growth opportunities
  • Open Time Off in addition to 10 paid holidays
  • 401(k) matching program
  • Adoption Assistance
  • Fertility treatments
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service