ManTech - Ashburn, VA

posted 4 days ago

Full-time
Ashburn, VA
Professional, Scientific, and Technical Services

About the position

The SME Data Engineer at ManTech will play a crucial role in developing complex data analytical solutions for U.S. Customs and Border Protection (CBP). This position involves working with large datasets to support law enforcement personnel in assessing potential threats entering the country. The engineer will collaborate with government clients and technical teams to optimize data workflows and implement data migration strategies, ensuring the integrity and accessibility of data for analysis and reporting.

Responsibilities

  • Develop large volume data sets sourced from relational tables for data analysts and scientists.
  • Assist in creating, managing, and optimizing scheduled ETL jobs and workflows.
  • Conduct data analysis and problem-solving for large datasets used in various analytical products.
  • Implement data migration/pipelines from on-prem to cloud/non-relational storage platforms.
  • Respond to data queries and analysis requests from various organizational groups.
  • Create and publish regularly scheduled and ad hoc reports as needed.
  • Research and document data definitions and provenance for primary data sets supporting core business applications.
  • Manage data engineering source code control using GitLab.

Requirements

  • Experience with relational databases and knowledge of query tools and/or BI tools like Power BI or OBIEE.
  • Experience with the Hadoop ecosystem, including HDFS, YARN, Hive, Pig, and distributed processing methods such as Spark, Kafka, or Storm.
  • Strong experience in automating ETL jobs via UNIX/LINUX shell scripts and CRON jobs.
  • Practical understanding of data warehousing from a production relational database environment.
  • Strong experience using analytic functions within Oracle or similar tools in non-relational database systems.
  • Experience with Atlassian suite of tools such as Jira and Confluence.
  • Knowledge of Continuous Integration & Continuous Development tools (CI/CD).
  • Ability to multitask efficiently and work in a dynamic data environment.
  • Excellent verbal/written communication and problem-solving skills.

Nice-to-haves

  • 5 years of experience in developing, maintaining, and optimizing complex Oracle PL/SQL packages.
  • 10 years of experience in large complex data warehousing environments (80 TB).
  • Experience with relational database systems such as Oracle, MySQL, Postgres, SQL Server, with emphasis on Oracle.
  • Experience in architecting data engineering pipelines/data lakes within cloud services (AWS, GCP).
  • Experience with Amazon S3, Redshift, EMR, and Scala.
  • Experience with migrating on-prem legacy database objects to Amazon S3 cloud environment.
  • Strong experience in converting JSON documents to targets such as Parquet, Postgres, and Redshift.
  • Familiarity with data science/machine learning development for structured and unstructured datasets.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service