Inovex Information Systems - Herndon, VA

posted 3 months ago

Full-time
Herndon, VA
Professional, Scientific, and Technical Services

About the position

HTS (iNovex) is committed to prioritizing people and fostering a strong work/life balance for its employees. We invest in our team members by providing exceptional benefits, flexible schedules, and the necessary tools for success, including paid training and mentoring. Our mission is critical to ensuring the security of our nation through System Engineering, Network Engineering, Systems Integration, and Software Engineering & Development. We are seeking experienced Systems Engineers/Site Reliability Engineers (SRE) to support a key Government customer in a technology-based program. As a Systems Engineer/SRE, you will play a vital role in guiding IT developers throughout the software development life cycle. Your responsibilities will include overseeing the development, testing, and implementation of technical solutions, ensuring they meet defined requirements. You may also provide Agile DevOps support for mission-critical systems, contributing to the creation of robust systems, software, and cloud environments while maintaining operations and maintenance for critical systems. This position will require you to provide technical expertise in the design, development, implementation, and testing of customer tools and applications, all within a DevOps framework. You will participate in and/or direct major project deliverables through all aspects of the software development lifecycle, including scope and work estimation, architecture and design, coding, and unit testing.

Responsibilities

  • Ensuring reliability and getting systems back to steady state as quickly as possible
  • Eliminating toil and automating wherever possible
  • Driving better cross-team collaboration
  • Gaining full visibility into IT systems and services for system health
  • Identifying system deficiencies and recommending solutions
  • Developing Service Level Indicators (SLI) for IT systems and services
  • Developing Service Level Objectives (SLO) for IT systems and services
  • Developing Service Level Agreements (SLA) for IT systems and services
  • Maintenance and continuous improvement of processes, standards, policies, working methods, and tools
  • Ensuring appropriate tools and processes are in place for a reliable and reproducible development/production environment
  • Ensuring tool configuration consistency across Development, Testing, Integration, and Production environments
  • Participating in ongoing production support and end user support
  • Researching, understanding, and developing using new technologies and standards as needed
  • Evaluating interface between hardware and software, operational requirements, and characteristics of the overall system

Requirements

  • A minimum of sixteen (16) years relevant experience with Bachelor's or Master's degrees
  • Knowledgeable in Incident Management and organizing Incident Response Teams
  • Good understanding of incident response role structure and concepts for automating incident resolution
  • Knowledgeable with SLO to help Operations define and improve SLO for IT systems
  • CI/CD implementation expertise
  • Scripting skills in languages such as MS PowerShell, Python, JavaScript, Ruby, PHP etc.
  • Ability to efficiently estimate work effort requirements
  • Effective communication skills, both written and verbal
  • Ability to handle multiple tasks and meet deadlines
  • Ability to work independently and in a team environment
  • Adaptability to a constantly changing environment
  • Willingness to work with newer emerging technologies/tools
  • Ability to deliver enhanced functionality and provide continuous support while preserving system integrity
  • High degree of initiative, creativity, and technical ability
  • Ability to identify issues and implement corrective actions

Nice-to-haves

  • IT project management experience
  • Familiarity with Scrum, Lean, Agile, and DevOps
  • Experience with Java, Ruby, DevOps, and DevSecOps
  • Knowledge of IT Operations Management (ITOM) software
  • Understanding of Quality Assurance and Test Automation for software pre-deployment
  • Good understanding of DevOps concepts and best practices
  • Issue troubleshooting experience
  • Understanding of Networking concepts
  • Linux/Unix concepts
  • ServiceNow knowledge in developing products using JavaScript and other coding applications
  • Database Administration (Oracle or MYSQL) experience

Benefits

  • Exceptional benefits
  • Flexible schedules
  • Paid training
  • Mentoring opportunities
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service