GovCIO - Springfield, IL
posted about 2 months ago
GovCIO is currently seeking a Technical Data Scientist/ETL Engineer to join our ETL Team, which is dedicated to ingesting and visualizing data from various cloud sources while alerting on deviations from normalization. This position is fully remote and will be based in Hanover, MD. The successful candidate will play a crucial role in developing, inspecting, mining, transforming, and analyzing data to create descriptive and predictive models that enhance productivity and decision-making, ultimately providing strategic mission impact. In this role, you will be responsible for data integration, applying data wrangling tools such as ETL and ELT, as well as programming languages to collect and blend data from operational and relevant external systems. You will conduct data analysis using data mining, machine learning, and statistical analysis to create predictive and descriptive models, which will include segmentation, clustering, forecasting, and classification. Additionally, you will utilize data visualization tools to interpret and present findings in a compelling manner, ensuring that analytical systems are maintained and integrated with operational systems while verifying the accuracy of data and analytics. The position requires generating new business insights through the extraction, storage, transformation, analysis, and visualization of diverse data sets. You will collect and transform structured, unstructured, relational, and NoSQL data using ETL and ELT tools, and develop custom code using programming languages. Understanding and utilizing distributed methods, such as MapReduce, to scale to multi-Terabyte data collections is essential. You will analyze data using data mining, machine learning, and statistical algorithms available in commercial off-the-shelf (COTS) tools, and build analytical solutions using programming languages and libraries. Collaboration is key in this role, as you will work closely with data subject matter experts (SMEs), business stakeholders, and management to prioritize business and information needs. The ideal candidate will have a strong background in data science and engineering, with a focus on delivering actionable insights and data-driven solutions.