Senior Data Engineer

$165,000 - $200,000/Yr

ZipRecruiter - Englewood, CO

posted 23 days ago

Full-time - Senior
Hybrid - Englewood, CO
51-100 employees

About the position

The Senior Data Engineer at Grey Matters Defense Solutions will play a crucial role in the data science group, focusing on building and maintaining scalable data pipelines, managing metadata storage, and deploying machine learning models. This position requires collaboration with data scientists and machine learning engineers to ensure efficient data processing and integration, ultimately supporting innovative solutions for the defense and intelligence sectors.

Responsibilities

  • Build and maintain data pipelines for ingestion, conversion, and processing of raw data for machine learning models.
  • Implement tagging and annotation tools to enhance data for model training, ensuring efficient metadata storage and retrieval.
  • Design and maintain databases for structured and unstructured data, including tags and metadata.
  • Automate data preprocessing tasks such as cleaning, normalization, and augmentation for neural network models.
  • Integrate data pipelines with machine learning workflows, collaborating with machine learning engineers on model training, deployment, and monitoring processes.
  • Set up and manage infrastructure for deploying machine learning models, including maintaining inference servers and continuous integration pipelines.
  • Implement model version control and management systems to track experiments and ensure smooth transitions between model iterations.

Requirements

  • Eight (8) years experience in a Data Engineer role.
  • Active TS (Top Secret) clearance.
  • Experience with machine learning workflows, including data pipelines and model deployment.
  • Familiarity with working with unstructured and structured data, converting them for use in machine learning models.
  • Strong understanding of MLops practices, including model versioning, monitoring, and CI/CD for machine learning models.
  • Experience in scaling infrastructure to handle large datasets and multiple models in production.

Nice-to-haves

  • Experience with MLops tools and platforms like Kubeflow, MLflow, or Seldon.
  • Proficiency in data engineering tools such as Airflow, Bash, Docker, Docker Compose, GDAL, Git, Linux, make, MongoDB.
  • Expertise in NVIDIA installations (CUDA, cuDNN, Drivers, NVIDIA CONTAINER TOOLKIT).
  • Strong experience with data conversion libraries such as Rasterio and RAY.
  • Experience with web and API development using Traefik and hosting tools.

Benefits

  • Medical insurance
  • Dental insurance
  • Vision insurance
  • Life insurance
  • Short-term disability insurance
  • Long-term disability insurance
  • SEP IRA 25% of base salary
  • PTO Six weeks
  • Employee assistance program
  • Employee discount
  • Flexible spending account
  • Health savings account
  • Referral program
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service