JBS Internationalposted about 1 month ago
Full-time • Senior
North Bethesda, MD
Administrative and Support Services

About the position

The Site Reliability Engineer III engages in diverse client-facing projects, focusing on the automation, deployment, monitoring, and upkeep of software solutions aligned with client specifications. This role involves close collaboration with fellow engineers, project managers, and stakeholders to enhance the quality and efficiency of software development and delivery processes. Additionally, The Site Reliability Engineer III is the Site Reliability Engineering (SRE) team lead and staff manager responsible for the career development, training, assessment, and hiring of SRE team members.

Responsibilities

  • Automates the software development and delivery processes using various CI/CD tools and technologies, such as BitBucket, BitBucket Pipelines, Jenkins, Docker, Ansible, Terraform, CloudFormation, Shell and Python scripts, etc.
  • Manage AWS cloud operations and administration of AWS CloudFront, WAF, ELB, EC2, RDS, S3, Systems Manager, and Grafana via the AWS Management Console, CloudFormation, and/or Ansible and AWX.
  • Monitors and troubleshoots the performance, availability, and security of the software solutions using various tools and techniques, such as Grafana, ELK, OpenTelemetry, AWS CloudWatch, CloudTrail, etc.
  • Collaborates with the development team to assist in the management of the software delivery lifecycle using Git operations from a CLI or the Bitbucket Web UI.
  • Collaborates and coordinates with cross-functional teams and stakeholders to ensure alignment and seamless integration of software solutions within the broader context of the SDLC and software delivery.
  • Continuous integration (CI) merges code changes to ensure the most recent version is available to developers.
  • Continuous delivery and continuous deployment (CD) - automate the process of releasing updates to increase efficiency.
  • Prompt and thorough installation of patches and updates to the full stack solution.
  • Troubleshooting and responding to issues that are escalated to SRE or SecOps.
  • Automating and running security scans- both OS and applications at least monthly, or per contract specifications.
  • Common weaknesses enumeration (CWE) mitigation - implementing corrective actions detected during security scans.
  • Threat modeling - implements security testing during the development pipeline to save time and cost in future.
  • Automated security testing - test for vulnerabilities in new builds on regular basis.
  • May be dedicated to one project or may split time across multiple projects providing broad-based support for the Digital Center Director.
  • Follows JBS policies, procedures, and best practices.
  • Designs and implements mitigations to federal security vulnerabilities discovered during continuous monitoring of our managed sites and applications.
  • Works collaboratively in several cross-functional project teams, such as Development and QA.
  • Practices strong and frequent communications with team members, project stakeholders, and client staff.
  • Delivers work on time and on budget.
  • Follows DevOps and DevSecOps industry advances and best practices.
  • Executes management, supervision, appraisal, and recruiting of the SRE team.
  • Supports Business Development opportunities and proposal development as needed.
  • Infrastructure as Code (IaC) - Experience using industry standard IaC software such as CloudFormation, Terraform, or Ansible.
  • Strong documentation skills in industry standard tools such as Confluence.
  • Contributes to business development (BD) and proposal development, as needed.

Requirements

  • Bachelor's degree in computer science with minimum of 8 years working in Site Reliability Engineering (SRE), DevOps, Software Development, or Infrastructure Operations, with at least 6 years of experience in Cloud DevOps.
  • In lieu of a bachelor's degree must have minimum of 10 years of related IT work experience.
  • Experience in designing and implementing software development and delivery processes using various tools and technologies, such as Jenkins, Docker, Ansible, etc.
  • Experience in working with various environments, platforms, and services, such as cloud, on-premise, or hybrid, AWS, Azure, Google Cloud, etc.
  • Experience in working with agile methodologies and tools, such as Scrum, Kanban, Jira, etc.
  • Experience in leading and managing teams responsible for software development, SRE, DevOps or, IT infrastructure management.
  • Certification in DevOps from a Cloud Service Provider, such as AWS or Azure.

Nice-to-haves

  • AWS Certification preferred.
  • Familiarity with CMMI Dev Level 3 methodology and appraisals helpful.
  • Familiarity with the FedRAMP and FISMA security frameworks, as well as NIST 800.53 Rev 5 and the NIST Risk Management Framework (RMF) needed to achieve and maintain Authority To Operate (ATO) with our federal clients helpful.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service