GovCIO - Lansing, MI
posted about 2 months ago
GovCIO is currently hiring for an ETL Engineer (or Data Scientist) to join our ETL Team focused on ingesting and visualizing data from all over the cloud and alerting on deviations of normalization. This position will be located in Hanover, MD and will be a fully remote position. The role involves developing, inspecting, mining, transforming, and analyzing data to create descriptive and predictive models that impact productivity, decision making, and provide strategic mission impact. The candidate will apply data wrangling tools including ETL, ELT, and programming languages to collect and blend data from operational and relevant external systems. In this position, the individual will be responsible for applying data mining, machine learning, and statistical analysis on data to create predictive and descriptive models. This includes developing segmentation, clustering, forecasting, classification, and other models. The role also requires the application of data discovery and data visualization tools to interpret and present findings in a compelling and usable manner. The candidate will maintain and integrate analytical systems with operational systems, verifying the accuracy of the data and analytics while interacting with both business and data Subject Matter Experts (SMEs). The ETL Engineer/Data Scientist will generate new business insights through data extraction, storage, transformation, analysis, and visualization of diverse data sets. This includes collecting and transforming structured, unstructured, relational, and NoSQL data using ETL and ELT tools, as well as developing custom code using programming languages. The candidate should understand and use distributed methods that scale to multi-Terabyte sized data collections. Additionally, the role involves analyzing data using data mining, machine learning, and statistical algorithms available in Commercial Off-The-Shelf (COTS) tools, and building analytical solutions using programming languages and libraries. The individual will interpret and evaluate the accuracy of results through iterative, agile methods, and work closely with data SMEs, business, and management to prioritize business and information needs.