Booz Allen Hamilton - Chantilly, VA
posted 2 months ago
The Site Reliability Administrator position offers an exciting opportunity to contribute to the resilience and efficiency of systems that support national security. As a member of our team, you will play a crucial role in engineering robust systems by building a resilient infrastructure. This involves implementing redundancy, utilizing monitoring tools, and automating processes to enhance system performance. Your work will focus on reducing operational toil through scripting and automating self-repair tasks, allowing for a more efficient workflow. This role is ideal for individuals with a background in network engineering, systems administration, or software development who are passionate about improving systems and processes. In this position, you will collaborate with a team dedicated to protecting national security while advancing your skills in cloud technologies and site reliability engineering (SRE). You will be responsible for deploying, configuring, and maintaining Linux server systems, as well as designing and managing services in Amazon Web Services (AWS). Your expertise will be essential in troubleshooting and resolving issues across applications, operating systems, and infrastructure, ensuring that systems operate smoothly and efficiently. This role requires a commitment to supporting shift work and maintaining a high level of security, as candidates must possess a TS/SCI clearance with a polygraph. The position also emphasizes the importance of continuous learning and professional development, with opportunities for upskilling, mentoring, and networking within the organization. Join us in making a difference in national security while growing your career in a supportive and inclusive environment.