Peraton - Herndon, VA

posted 4 months ago

Full-time - Senior
Herndon, VA
Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

About the position

The Senior Technical Site Reliability Engineering (SRE) Cloud Manager will play a pivotal role in leading a high-performing team responsible for providing 24/7 engineering and delivery support for multiple AWS hosting environments. This position requires a hands-on approach to managing a Site Reliability Engineers (SRE) / DevSecOps team, ensuring that all tasks are executed efficiently and effectively. The individual will be responsible for daily customer interactions, managing team tasks, and ensuring accountability for deliverables, metrics, and dashboards, ultimately driving the success of customer engagements. In this role, the manager will develop and manage Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) solutions through cross-technology administration for Amazon Web Services (AWS). The position involves leading efforts to utilize AWS platform services such as Amazon EC2, Lambda, CodeStar, Amazon Elastic Container Services, Fargate, and Amazon EKS to design and build modern applications that are secure, reliable, scalable, and quickly available for customers. The manager will also lead technical discussions with various stakeholders, including clients, application teams, suppliers, and program managers, to ensure successful delivery and ongoing maintenance of application releases and upgrades. The Senior Technical SRE Cloud Manager will be responsible for shaping, executing, and delivering solutions for a broad range of customer use cases, particularly for large enterprise customers. This includes providing expert thought leadership in enhancing infrastructure, application, and cloud management methodologies while implementing industry best practices. The role also involves managing cloud-based disaster recovery, high availability, and other technologies to ensure business continuity, as well as overseeing public cloud consumption cost management and providing cost-saving recommendations. Additionally, the manager will conduct project and issue management, including staffing, financials, quality, and risk reporting, while making decisions that impact the team. They will lead client infrastructure and cloud migration engagements, present at an executive level on cloud optimization, and analyze application portfolios to assess transformation feasibility. The position requires a strong focus on compliance and proactive management to prevent escalations within the program.

Responsibilities

  • Technically lead and provide hands-on support for a Site Reliability Engineers (SRE) / DevSecOps team.
  • Manage daily customer interactions and team tasks with accountability for deliverables and metrics.
  • Develop and manage IaaS and PaaS solutions through cross-technology administration for AWS.
  • Utilize AWS platform services to design and build secure, reliable, and scalable applications.
  • Lead technical discussions with stakeholders including clients and application teams.
  • Collaborate with other leaders to ensure successful delivery and maintenance of application releases.
  • Shape and deliver solutions for a broad range of customer use cases.
  • Provide expert thought leadership in infrastructure and cloud management methodologies.
  • Manage cloud-based disaster recovery and high availability technologies.
  • Oversee public cloud consumption cost management and provide cost-saving recommendations.
  • Conduct project and issue management for assigned scope of work.
  • Lead client infrastructure and cloud migration engagements.
  • Present cloud optimization strategies at an executive level.
  • Analyze application portfolios and assess transformation feasibility.
  • Lead design and implementation of new IT infrastructure and cloud environments.

Requirements

  • 12 years of experience in technical and program management roles.
  • Leadership and technical hands-on skills in delivering cloud-based solutions.
  • Experience managing AWS core services such as VPC, Route53, S3, EC2, IAM, RDS, EBS.
  • Proficiency in AWS Management Console and configuring AWS services.
  • Experience with infrastructure automation using Terraform and CloudFormation.
  • Knowledge of infrastructure monitoring tools like Splunk and CloudWatch.
  • Experience in AWS cost optimization and managing reserve instances.
  • Ability to document and explain cloud infrastructure and application solution architectures.
  • Experience in providing secured solutions in cloud environments.
  • Strong problem-solving skills and ability to manage ambiguity.
  • Effective oral and written communication skills.

Nice-to-haves

  • AWS Practitioner / AWS Architect Certification.
  • Bachelor's Degree or a Master's Degree.
  • Active 6C Public Trust clearance.

Benefits

  • Comprehensive medical plans.
  • Tuition reimbursement and assistance.
  • Fertility treatment support.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service