Hilton - McLean, VA

posted 3 months ago

Full-time - Senior
Remote - McLean, VA
501-1,000 employees
Accommodation

About the position

As a Senior Lead Site Reliability Engineer at Hilton, you will play a crucial role in building and supporting Continuous Integration/Continuous Deployment (CI/CD) pipelines and managing production releases. Your primary responsibility will be to solve complex performance and scaling issues, collaborating closely with engineers to prevent bottlenecks and ensure that we can meet traffic demands driven by both organic growth and marketing events. You will be tasked with developing and maintaining release architectures and monitoring frameworks that support the product team, enhancing process flows for efficient delivery. Additionally, you will provide system design consulting and critical support to the development team prior to program launches, ensuring that all systems are robust and reliable. In this position, you will also supervise direct reports, guiding them in their professional development and ensuring that the team meets its objectives. The role requires a proactive approach to problem-solving and a deep understanding of the technologies involved in site reliability engineering. You will be expected to travel domestically up to 25% of the time, which may involve visiting various Hilton locations to support operational needs and collaborate with teams across the organization.

Responsibilities

  • Build and support CI/CD pipelines and production releases.
  • Solve sophisticated performance and scaling issues in collaboration with engineers.
  • Develop and maintain release architectures and monitoring frameworks.
  • Provide system design consulting and critical support to the development team prior to program launch.
  • Supervise direct reports and guide their professional development.

Requirements

  • Bachelor's degree in Industrial Management, Engineering, Systems Engineering, or a closely related field.
  • Three (3) years of experience in reliability engineering or a related field.
  • Experience with Kubernetes/EKS and pod life cycle management including readiness and liveness checks.
  • Proficiency in using Docker and BASH shell scripting.
  • Experience with Dynatrace APM and RUM, Datadog, and logging analysis with Splunk.
  • Experience in pipeline creation, troubleshooting, and configuration of Gitlab CI.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service