Data Engineer, Paydarfar Lab

$80,000 - $80,000/Yr

The University of Texas System - Austin, TX

posted 2 months ago

Part-time - Mid Level
Austin, TX
Educational Services

About the position

The Department of Neurology at the Dell Medical School is seeking a Data Engineer for the Paydarfar Lab. This position is a one-year fixed-term appointment that is renewable based on funding, performance, and business needs. The primary purpose of this role is to develop and maintain a data platform that captures, analyzes, and visualizes various data streams related to cognitive disease states. The Data Engineer will be responsible for creating a robust, secure, HIPAA-compliant, cloud-based data warehouse infrastructure that can ingest data from multiple sources, including wearable sensor data, hospital monitoring systems, electronic medical records (EMR), and cognitive and behavioral assessments. In this role, the Data Engineer will develop extract, transform, and load (ETL) procedures to capture time-synchronized data streams from different sources. They will design and automate ETL solutions for data cleansing, preparation, validation, and related processes. The engineer will also construct warehouse infrastructure to facilitate automated feature extraction and retrospective population-based analysis. As the data landscape evolves, the engineer will adapt ETL procedures to meet changing data models and business needs, ensuring data quality and integrity. Collaboration is key in this position, as the Data Engineer will partner with clinicians, cognitive and behavioral therapists, physiologists, data modelers, and other team members. They will provide documentation for institutional knowledge retention, maintain timelines in coordination with principal investigators (PIs) and core leads, and assist in training project staff and research assistants involved in data collection. The role may also involve facilitating project collaborations and teamwork, along with other related duties as assigned.

Responsibilities

  • Develop and implement a robust, secure, HIPAA-compliant, cloud-based data warehouse infrastructure capable of ingesting data streams from various sources.
  • Develop extract, transform, and load (ETL) procedures to capture time-synchronized streams from different sources, including wearable devices and hospital EMR systems.
  • Design and automate ETL solutions for data cleansing, preparation, data validation, and related processes.
  • Construct warehouse infrastructure to allow for automated feature extraction and population-based analysis retrospectively.
  • Adapt ETL procedures to evolving data models and business needs.
  • Adapt ETL procedures in response to data quality assurance findings.
  • Partner with key stakeholders, including clinicians, cognitive and behavioral therapists, physiologists, data modelers, and other teammates.
  • Provide documentation for institutional knowledge retention.
  • Develop and maintain timelines in coordination with PIs and Core leads.
  • Collaborate with PIs and project investigators to prepare methodological notes, reports, presentations, and other dissemination activities related to the project activities.
  • Assist in training project staff and research assistants who may be helping with data collection.
  • Help to facilitate project collaborations and teamwork.
  • Perform other related duties as assigned.

Requirements

  • BS degree in Computer Science, Information Systems, Engineering, or equivalent professional experience.
  • Two years of experience designing and implementing data warehouse infrastructure and ETL solutions for complete data models using multiple data sources.
  • Two years of experience with Python, SQL, or equivalent programming languages.
  • Strong communication and collaboration skills and good documentation habits.
  • Demonstrated experience with orchestration and automation of database processes.
  • Strong aptitude for troubleshooting and problem solving.
  • Team player, comfortable communicating cross-functionally and across management levels.
  • Self-motivated and able to organize work independently in a rapidly changing environment.

Nice-to-haves

  • Graduate degree in computer science or related field with five years of professional experience.
  • Some experience with life sciences or medical systems.
  • Three years of experience working in a cloud-based data warehouse environment.
  • Experience with multiple types of databases.
  • Experience with time-series data structures and analytics.
  • Experience with ETL procedures and mobile/wearable devices.
  • Cloud platform certification (e.g., AWS).
  • Experience with issue tracking systems (e.g., JIRA).

Benefits

  • Competitive salaries
  • Full benefits
  • Extensive support network
  • Collaborative working community
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service