GovCIO - Saint Paul, MN
posted about 2 months ago
GovCIO is currently hiring for an ETL Engineer (or Data Scientist) to join our ETL Team focused on ingesting and visualizing data from all over the cloud and alerting on deviations of normalization. This position will be located in Hanover, MD, and will be a fully remote position. The role involves developing, inspecting, mining, transforming, and analyzing data to create descriptive and predictive models that impact productivity, decision-making, and provide strategic mission impact. The ETL Engineer will apply data wrangling tools including ETL, ELT, and programming languages to collect and blend data from operational and relevant external systems. The position requires applying data mining, machine learning, and statistical analysis on data to create predictive and descriptive models. The candidate will also be responsible for applying and integrating these models to develop segmentation, clustering, forecasting, classification, and other models. Data visualization is a key component of this role, where the engineer will use data discovery and visualization tools to interpret and present findings in a compelling and usable manner. Additionally, the ETL Engineer will maintain and integrate analytical systems with operational systems, verify the accuracy of the data and analytics, and interact with both business and data Subject Matter Experts (SMEs). The role also involves generating new business insights through data extraction, storage, transformation, analysis, and visualization of diverse data sets. The candidate will collect and transform structured, unstructured, relational, and NoSQL data using ETL and ELT tools, as well as develop custom code using programming languages. Understanding and using distributed methods that scale to multi-Terabyte sized data collections is essential. The engineer will analyze data using data mining, machine learning, and statistical algorithms available in Commercial Off-The-Shelf (COTS) tools, and build analytical solutions using programming languages and libraries. The position requires interpreting and evaluating the accuracy of results through iterative, agile methods, and working closely with data SMEs, business, and management to prioritize business and information needs.