GovCIO - Boise, ID
posted about 2 months ago
GovCIO is currently hiring for an ETL Engineer (or Data Scientist) to join our ETL Team focused on ingesting and visualizing data from all over the cloud and alerting on deviations of normalization. This position will be located in Hanover, MD, and will be a fully remote position. The successful candidate will be responsible for developing, inspecting, mining, transforming, and analyzing data to create descriptive and predictive models that impact productivity, decision making, and provide strategic mission impact. The role involves applying data wrangling tools including ETL, ELT, and programming languages to collect and blend data from operational and relevant external systems. The candidate will also apply data mining, machine learning, and statistical analysis on data to create predictive and descriptive models, integrating these models to develop segmentation, clustering, forecasting, classification, and other models. Data visualization is a key component of this role, where the candidate will apply data discovery and visualization tools to interpret and present findings in a compelling and usable manner. Maintaining and integrating analytical systems with operational systems, verifying the accuracy of the data and analytics, and interacting with both business and data SMEs are also essential responsibilities. The ETL Engineer/Data Scientist will generate new business insights through data extraction, storage, transformation, analysis, and visualization of diverse data sets. This includes collecting and transforming structured, unstructured, relational, and NoSQL data using ETL and ELT tools, as well as developing custom code using programming languages. The candidate should understand and use distributed methods that scale to multi-Terabyte sized data collections. Analyzing data using data mining, machine learning, and statistical algorithms available in COTS tools, and building analytical solutions using programming languages and libraries are also part of the job. The candidate will interpret and evaluate the accuracy of results through iterative, agile methods and will work closely with data SMEs, business, and management to prioritize business and information needs.