Virginia Tech - Blacksburg, VA

posted 29 days ago

Full-time - Mid Level
Blacksburg, VA
10,001+ employees
Educational Services

About the position

The Data Architect position at Virginia Tech involves joining the Data and Analytics Infrastructure team to support the office of Analytics and Institutional Effectiveness (A&IE). The role focuses on developing a robust data strategy to ensure high data quality, effective data storage, and well-defined data governance, contributing to the mission of providing insights and intelligence derived from institutional data.

Responsibilities

  • Collaborate with data scientists to understand the data and define best practices for processing, cataloging, and analyzing the data
  • Manage, troubleshoot, and optimize extract, transform, and load (ETL) workflows
  • Design and implement a data lineage solution to track the lifecycle of the data
  • Review and implement policies to ensure the privacy and security of the data
  • Ensure data is processed and stored in a way to support various use cases, including AI applications
  • Management and design of databases

Requirements

  • Bachelor's degree in computer science, data science or a related field
  • Several years of professional experience as a data engineer or data architect
  • Experience designing and managing large ETL workflows
  • Proficiency in Python and data science packages such as Pandas and Numpy
  • Good understanding of multiple types of databases: relational, vector, graph, document
  • Good understanding of various types of data storage methods
  • Experience preparing and optimizing data for generative AI applications
  • Experience with data lineage tools and frameworks
  • Passion for implementing industry standards and best practices
  • Effective verbal and written communication skills

Nice-to-haves

  • Willingness to learn and experiment with new technologies
  • Strong experience with Apache Airflow
  • Experience in the AWS ecosystem
  • Strong knowledge of data processing optimization tools and techniques
  • Experience administering PostgreSQL relational databases
  • Ability to build insights and visualizations based on business questions and available data

Benefits

  • Professional development opportunities
  • Inclusive community
  • Diversity and inclusion initiatives
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service