Leidos - Bethesda, MD

posted 6 months ago

Full-time - Mid Level
Remote - Bethesda, MD
Professional, Scientific, and Technical Services

About the position

As a System Administrator specializing in Machine Learning Operations (MLOps), you will play a crucial role in supporting the infrastructure, deployment, and maintenance of a machine translation triage application that leverages advanced Artificial Intelligence (AI), Machine Learning (ML), and Data Science (DS) capabilities. This position is integral to a major program that aims to enhance the state of the art in MLOps, focusing on mission-driven big data analytics and predictive analytics. Your contributions will be vital in delivering cutting-edge machine learning capabilities that support national security objectives, enabling swift production and analysis of results, and disseminating findings that provide actionable intelligence insights. In this role, you will collaborate closely with a cross-functional agile application development team, which includes data scientists, data engineers, software developers, system engineers, researchers, and data analysts. Together, you will design, build, and optimize a complex, resource-intensive application. You will be responsible for managing multiple simultaneous work packages, taking high-level guidance, and independently providing first-class infrastructure and deployment solutions. Your intellectual curiosity, quantitative skills, and customer-focused approach will be essential in identifying innovative methods to enhance system performance through effective system administration optimizations, all while working alongside a highly qualified and motivated team. The position is primarily on-site at our client location in Bethesda, MD, but offers a flexible schedule, with some tasks potentially performed remotely depending on client requirements and deliverable priorities. This flexibility allows you to adapt to the dynamic needs of the project while maintaining a focus on delivering high-quality results.

Responsibilities

  • Support the infrastructure, deployment, and maintenance of a machine translation triage application.
  • Collaborate with a cross-functional agile application development team to design, build, and optimize applications.
  • Manage multiple simultaneous work packages and provide infrastructure and deployment solutions.
  • Identify novel approaches to improve system performance through administration optimizations.
  • Analyze and assess infrastructure and application deployment requirements to determine cost-effective solutions.

Requirements

  • Bachelor's degree or equivalent with a minimum of eight years of experience in a related field.
  • 4+ years of experience in system administration, preferably in a cloud-native environment.
  • Active TS/SCI clearance and ability to maintain it with Polygraph.
  • Strong experience with Amazon Web Services (AWS/C2S).
  • Strong experience with Linux OS system administration.
  • Experience in application and infrastructure deployment, configuration, and maintenance.
  • Ability to gain and maintain Privileged User Access (PUA) on the customer's network.
  • Experience maintaining hardware in compliance with security policy.
  • Proficiency in one or more system administration scripting languages.
  • Track record of active learning and creative problem solving.
  • Ability to work in a fast-paced environment and learn new skills quickly.

Nice-to-haves

  • Current TS/SCI with polygraph clearance.
  • Experience in direct support of military or intelligence community customers.
  • Experience supporting data team operations.
  • Interest in data science, data engineering, or machine learning.
  • Experience in system administration in an on-prem, air-gapped environment.
  • Experience configuring and optimizing storage solutions.
  • Experience configuring and optimizing networking solutions.
  • Experience deploying and maintaining Kubernetes applications.
  • Familiarity with IT automation tools like SaltStack, Ansible, Terraform.
  • Familiarity with virtualization and distributed file systems like Hadoop.
  • Familiarity with version control and program management technologies like git, svn, JIRA.
  • AWS professional certifications.
  • Familiarity with NVIDIA GPUs and appliances.

Benefits

  • Flexible schedule with occasional remote work options.
  • Competitive salary range from $87,100 to $157,450 per year.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service