GovCIO - Boston, MA
posted about 2 months ago
GovCIO is currently seeking an ETL Engineer (or Data Scientist) to join our ETL Team, which is dedicated to ingesting and visualizing data from various cloud sources while alerting on deviations from normalization. This position is fully remote and will be based in Hanover, MD. The successful candidate will play a crucial role in developing, inspecting, mining, transforming, and analyzing data to create descriptive and predictive models that significantly impact productivity and decision-making, ultimately providing strategic mission impact. In this role, you will be responsible for data integration, applying data wrangling tools such as ETL and ELT, as well as programming languages to collect and blend data from operational and relevant external systems. You will conduct data analysis using data mining, machine learning, and statistical analysis to create predictive and descriptive models, which will be integrated into various applications. Your work will also involve data visualization, where you will utilize data discovery and visualization tools to interpret and present findings in a compelling and usable manner. Additionally, you will maintain and integrate analytical systems with operational systems, ensuring the accuracy of data and analytics while collaborating with both business and data subject matter experts (SMEs). The position requires generating new business insights through the extraction, storage, transformation, analysis, and visualization of diverse data sets. You will collect and transform structured, unstructured, relational, and NoSQL data using ETL and ELT tools, and develop custom code using programming languages. Understanding and utilizing distributed methods, such as MapReduce, to scale to multi-Terabyte data collections is essential. You will analyze data using various data mining and machine learning algorithms available in commercial off-the-shelf (COTS) tools and build analytical solutions using programming languages and libraries. Your role will also involve interpreting and evaluating the accuracy of results through iterative, agile methods, and applying data discovery and visualization tools to develop actionable data stories. Close collaboration with data SMEs, business stakeholders, and management will be necessary to prioritize business and information needs effectively.