This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Class Technologiesposted 26 days ago
$100,000 - $130,000/Yr
Full-time • Mid Level
Washington, DC
Resume Match Score

About the position

We are seeking a skilled DevOps Engineer to join our team. The ideal candidate will be responsible for designing, implementing, and maintaining cloud infrastructure on AWS and Azure, primarily using Terraform for infrastructure as code. You will manage and optimize Kubernetes clusters for production workloads, implement CI/CD pipelines for automated testing and deployment, and collaborate with development teams to implement containerization strategies. Additionally, you will monitor and optimize system performance, capacity, and availability, and implement robust logging and monitoring solutions. In the realm of security and compliance, you will implement and maintain security controls to meet FedRAMP, NIST 800-53 Rev 5, and NIST 800-171 requirements, participate in security assessments, and conduct regular security reviews. You will also be responsible for disaster recovery and business continuity, developing and testing disaster recovery procedures, maintaining business continuity plans, and ensuring data replication across multiple regions. As part of operations, you will participate in an on-call rotation, troubleshoot complex infrastructure issues, respond to security incidents, and maintain comprehensive documentation of systems and processes. You will also provide mentorship and guidance to junior team members.

Responsibilities

  • Design, implement, and maintain cloud infrastructure on AWS and Azure using infrastructure as code (primarily Terraform)
  • Manage and optimize Kubernetes clusters for production workloads
  • Implement and maintain CI/CD pipelines for automated testing and deployment
  • Collaborate with development teams to implement containerization strategies
  • Monitor and optimize system performance, capacity, and availability
  • Implement and maintain robust logging and monitoring solutions
  • Implement and maintain security controls to meet FedRAMP, NIST 800-53 Rev 5, and NIST 800-171 requirements
  • Participate in security assessments and remediation efforts
  • Implement and maintain security baseline configurations
  • Conduct regular security reviews of infrastructure and applications
  • Document and maintain security procedures and policies
  • Develop, implement, and regularly test disaster recovery procedures
  • Maintain and update business continuity plans
  • Implement automated backup and recovery solutions
  • Conduct simulated disaster recovery exercises
  • Ensure data replication and redundancy across multiple regions
  • Develop and maintain runbooks for critical system recoveries
  • Participate in an on-call rotation (1 week every 4-6 weeks)
  • Troubleshoot and resolve complex infrastructure issues
  • Respond to and remediate security incidents
  • Maintain comprehensive documentation of systems and processes
  • Use Jira for task management, incident tracking, and workflow automation
  • Provide mentorship and guidance to junior team members

Requirements

  • 3+ years of experience in DevOps, Site Reliability Engineering, or similar roles
  • Expert-level knowledge of AWS services including S3, EC2, EKS, ALB, FSX, WorkSpaces, Directory Services, ECS, Fargate, RDS, and Lambda
  • Proficient with Azure services (equivalent to AWS services mentioned above)
  • Advanced knowledge of Terraform for infrastructure as code
  • Deep understanding of Kubernetes administration and architecture
  • Strong experience with Git version control and CI/CD pipelines
  • Experienced with containerization technologies (Docker, Kubernetes)
  • Familiarity with FedRAMP, NIST 800-53 Rev 5, and NIST 800-171 requirements
  • Experience implementing and maintaining security controls for cloud environments
  • Experience with implementing and testing disaster recovery procedures
  • Strong documentation skills and experience with Jira
  • Excellent verbal and written communication skills
  • Ability to work independently and as part of a team
  • Problem-solving skills and ability to work under pressure

Nice-to-haves

  • Experience with Adobe Connect and/or Adobe Learning Manager
  • Experience with eLearning platforms or learning management systems
  • Experience with PaaS and SaaS offerings
  • Experience with network security and firewall configuration
  • Experience with database administration (SQL and NoSQL)
  • Experience with scripting languages (Python, Bash, PowerShell)
  • Experience with configuration management tools (Ansible, Chef, Puppet)
  • Experience with log aggregation and analysis tools
  • Experience with monitoring tools (Prometheus, Grafana, CloudWatch)

Benefits

  • Medical, Dental, Vision + More Benefits

Job Keywords

Hard Skills
  • Ansible
  • Bash
  • JIRA
  • Kubernetes
  • Terraform
  • 2QT3Uoeuh Lp1DOPi53FJr
  • 3ePM
  • 4zKoC DtxiJd95z3m
  • 7LBFuk rCPyEn01e493ahm
  • 7Oc9x N7WE
  • 8swj5OX aVnixJWA1y3H
  • AzCaNVl wzHDGBkRIVUgN
  • CKdzI4b2A knMghHdRcQu
  • CPmEtJX
  • d9fHlP1
  • eCVSdp 5wPL2yb4
  • eEfF2nBD1NbT zLmpPTAwk
  • enu0T HG4j
  • FUrDlA
  • Gt3DreI47 F27j6mfGQxzNYRI
  • i7b0yD3BcgxGOU 94KrFTRyCoh
  • jIMxD4lhzCY XJqFWHrdE4R8iet
  • Lhu9t7X45 6TZwG5R3qE
  • M2izsoG TfP4qlKbMIA9N3 qSfD1QK
  • mUct
  • oAtj5IzHb ia3G9MZ6P12
  • oZBF rmxVvj3DA5e
  • PMY0pgNORn Z7kdhnTD
  • q6LdPrlUMpRBQtm smf hZvMs
  • QiA7IEs 5wPTrj8NWzdy
  • rdRV305n
  • reJcswq 1oKp3lb40JFr2
  • RPoipYU
  • Suv2r1GiR0U
  • Sys3abPC wn7psjc6G
  • tnwhEDys ZQmq9rj7
  • uyeqvNUGA iXvKSzUDE10
  • uYrlfwBZ639U K1E6MeIwpGhi
  • V9UF20mes TSM1I5RnP
  • Vf2UXp
  • Vnvbalqi ybKZBFfa
  • XybKMEtsR YFynizj7XBp vdbm 2f0gTNjDK wzPX9tHs6
  • yaCKSWHsn IjOYNqnSH
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service