Peraton - Herndon, VA
posted 4 months ago
The Senior Technical Site Reliability Engineering (SRE) Cloud Manager will play a pivotal role in leading a high-performing team responsible for providing 24/7 engineering and delivery support for multiple AWS hosting environments. This position requires a hands-on approach to managing a Site Reliability Engineers (SRE) / DevSecOps team, ensuring that all tasks are executed efficiently and effectively. The individual will be responsible for daily customer interactions, managing team tasks, and ensuring accountability for deliverables, metrics, and dashboards, ultimately driving the success of customer engagements. In this role, the manager will develop and manage Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) solutions through cross-technology administration for Amazon Web Services (AWS). The position involves leading efforts to utilize AWS platform services such as Amazon EC2, Lambda, CodeStar, Amazon Elastic Container Services, Fargate, and Amazon EKS to design and build modern applications that are secure, reliable, scalable, and quickly available for customers. The manager will also lead technical discussions with various stakeholders, including clients, application teams, suppliers, and program managers, to ensure successful delivery and ongoing maintenance of application releases and upgrades. The Senior Technical SRE Cloud Manager will be responsible for shaping, executing, and delivering solutions for a broad range of customer use cases, particularly for large enterprise customers. This includes providing expert thought leadership in enhancing infrastructure, application, and cloud management methodologies while implementing industry best practices. The role also involves managing cloud-based disaster recovery, high availability, and other technologies to ensure business continuity, as well as overseeing public cloud consumption cost management and providing cost-saving recommendations. Additionally, the manager will conduct project and issue management, including staffing, financials, quality, and risk reporting, while making decisions that impact the team. They will lead client infrastructure and cloud migration engagements, present at an executive level on cloud optimization, and analyze application portfolios to assess transformation feasibility. The position requires a strong focus on compliance and proactive management to prevent escalations within the program.