This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

DevOps Engineer

Class Technologiesposted 26 days ago

$100,000 - $130,000/Yr

Full-time • Mid Level

Washington, DC

About the position

We are seeking a skilled DevOps Engineer to join our team. The ideal candidate will be responsible for designing, implementing, and maintaining cloud infrastructure on AWS and Azure, primarily using Terraform for infrastructure as code. You will manage and optimize Kubernetes clusters for production workloads, implement CI/CD pipelines for automated testing and deployment, and collaborate with development teams to implement containerization strategies. Additionally, you will monitor and optimize system performance, capacity, and availability, and implement robust logging and monitoring solutions. In the realm of security and compliance, you will implement and maintain security controls to meet FedRAMP, NIST 800-53 Rev 5, and NIST 800-171 requirements, participate in security assessments, and conduct regular security reviews. You will also be responsible for disaster recovery and business continuity, developing and testing disaster recovery procedures, maintaining business continuity plans, and ensuring data replication across multiple regions. As part of operations, you will participate in an on-call rotation, troubleshoot complex infrastructure issues, respond to security incidents, and maintain comprehensive documentation of systems and processes. You will also provide mentorship and guidance to junior team members.

Responsibilities

Design, implement, and maintain cloud infrastructure on AWS and Azure using infrastructure as code (primarily Terraform)
Manage and optimize Kubernetes clusters for production workloads
Implement and maintain CI/CD pipelines for automated testing and deployment
Collaborate with development teams to implement containerization strategies
Monitor and optimize system performance, capacity, and availability
Implement and maintain robust logging and monitoring solutions
Implement and maintain security controls to meet FedRAMP, NIST 800-53 Rev 5, and NIST 800-171 requirements
Participate in security assessments and remediation efforts
Implement and maintain security baseline configurations
Conduct regular security reviews of infrastructure and applications
Document and maintain security procedures and policies
Develop, implement, and regularly test disaster recovery procedures
Maintain and update business continuity plans
Implement automated backup and recovery solutions
Conduct simulated disaster recovery exercises
Ensure data replication and redundancy across multiple regions
Develop and maintain runbooks for critical system recoveries
Participate in an on-call rotation (1 week every 4-6 weeks)
Troubleshoot and resolve complex infrastructure issues
Respond to and remediate security incidents
Maintain comprehensive documentation of systems and processes
Use Jira for task management, incident tracking, and workflow automation
Provide mentorship and guidance to junior team members

Requirements

3+ years of experience in DevOps, Site Reliability Engineering, or similar roles
Expert-level knowledge of AWS services including S3, EC2, EKS, ALB, FSX, WorkSpaces, Directory Services, ECS, Fargate, RDS, and Lambda
Proficient with Azure services (equivalent to AWS services mentioned above)
Advanced knowledge of Terraform for infrastructure as code
Deep understanding of Kubernetes administration and architecture
Strong experience with Git version control and CI/CD pipelines
Experienced with containerization technologies (Docker, Kubernetes)
Familiarity with FedRAMP, NIST 800-53 Rev 5, and NIST 800-171 requirements
Experience implementing and maintaining security controls for cloud environments
Experience with implementing and testing disaster recovery procedures
Strong documentation skills and experience with Jira
Excellent verbal and written communication skills
Ability to work independently and as part of a team
Problem-solving skills and ability to work under pressure