GovCIO - Santa Fe, NM
posted about 2 months ago
GovCIO is currently hiring for an ETL Engineer (or Data Scientist) to join our ETL Team focused on ingesting and visualizing data from all over the cloud and alerting on deviations of normalization. This position will be located in Hanover, MD, and will be a fully remote position. The role involves developing, inspecting, mining, transforming, and analyzing data to create descriptive and predictive models that impact productivity and decision-making, providing strategic mission impact. The ETL Engineer will apply data wrangling tools including ETL, ELT, and programming languages to collect and blend data from operational and relevant external systems. The position requires applying data mining, machine learning, and statistical analysis on data to create predictive and descriptive models, integrating these models to develop segmentation, clustering, forecasting, classification, and other models. Data visualization is also a key component, as the engineer will use data discovery and visualization tools to interpret and present findings in a compelling and usable manner. The role includes maintaining and integrating analytical systems with operational systems, verifying the accuracy of the data and analytics, and interacting with both business and data Subject Matter Experts (SMEs). The engineer will generate new business insights through data extraction, storage, transformation, analysis, and visualization of diverse data sets. This includes collecting and transforming structured, unstructured, relational, and NoSQL data using ETL and ELT tools, as well as developing custom code using programming languages. The role requires understanding and using distributed methods that scale to multi-Terabyte sized data collections, analyzing data using data mining, machine learning, and statistical algorithms available in Commercial Off-The-Shelf (COTS) tools, and building analytical solutions using programming languages and libraries. The ETL Engineer will interpret and evaluate the accuracy of results through iterative, agile methods, and will work closely with data SMEs, business, and management to prioritize business and information needs.