Oak Ridge National Laboratory - Oak Ridge, TN

posted about 2 months ago

Full-time - Mid Level
Oak Ridge, TN
Professional, Scientific, and Technical Services

About the position

We are hiring a Data Engineer for the Knowledge Discovery Infrastructure (KDI) program at Oak Ridge National Laboratory (ORNL). This role focuses on collaborating with sponsors and research users to deliver and maintain data pipelines, including ETL (Extract, Transform, Load) and reporting workflows, as well as managing data lifecycles. The KDI program is dedicated to facilitating research activities on data entrusted to ORNL by the U.S. Department of Veteran's Affairs (VA) and other agencies. As a Data Engineer, you will have the opportunity to contribute to impactful research and development programs in healthcare informatics, bioinformatics, and computer science. This position is part of the Emerging Technologies & Computing group within the Research Computing division of the Information Technology Services Directorate at ORNL. Our Data Engineers are expected to be proficient in various technologies, including Linux, SQL, Python, containers, Pandas, Spark, and source control, all within a high-security environment. You will be responsible for developing high-scale, robust data warehouses and data lakes, with a particular emphasis on state-of-the-art solutions for healthcare, life sciences, and genomics. In this role, you will work closely with research teams to understand their data needs and provide domain expertise regarding data models, query optimizations, and schema interpretation. You will design, build, and launch new data marts for major national research programs and ETL processes, as well as develop architectures for data intake, curation, organization, and dissemination in support of data science and related fields. Your contributions will help ensure that ORNL's mission is met by aligning behaviors, priorities, and interactions with our core values of Impact, Integrity, Teamwork, Safety, and Service. Additionally, you will promote diversity, equity, inclusion, and accessibility by fostering a respectful workplace.

Responsibilities

  • Develop high-scale, robust data warehouses and data lakes focusing on healthcare, life sciences, and genomics.
  • Collaborate with research teams to understand data needs and provide domain expertise on data models, query optimizations, and schema interpretation.
  • Design, build, and launch new data marts for major national research programs and ETL processes.
  • Develop architectures for data intake, curation, organization, and dissemination in support of data science.
  • Design and develop high-performance database architectures.
  • Research and evaluate state-of-the-art data and information management technologies.
  • Work on various data assignments, collaborating with scientists and engineers to produce reliable data products and systems.
  • Align behaviors, priorities, and interactions with ORNL's core values and promote a respectful workplace.

Requirements

  • A BS in computer science, information technology, mathematics, engineering, or a related field of study and five (5) to seven (7) years of aligned experience is required.
  • Three (3) or more years of proven experience in database design and/or development.

Nice-to-haves

  • MS or Doctorate in computer science, information technology, mathematics, engineering, or a related field of study.
  • At least 3 years of proven experience in basic and advanced SQL, database programming, and scripting languages (Python preferred).
  • Experience with data warehousing systems and data-driven software development.
  • Solid understanding of database system administration.
  • Demonstrated expertise with OLTP and data warehousing, SQL Server is preferred.
  • Experience in designing analytics data systems.
  • Experience working with Big Data processing frameworks such as Spark, Dask, and Pandas, and familiarity with NoSQL architectures.
  • Demonstrated experience with collecting, organizing, storing, and preparing data for analysis.
  • Experience with healthcare informatics is highly desired.
  • Working experience or basic understanding of AI/ML data requirements.

Benefits

  • Medical and retirement plans
  • Flexible work hours
  • On-site fitness, banking, and cafeteria facilities
  • Prescription Drug Plan
  • Dental Plan
  • Vision Plan
  • 401(k) Retirement Plan
  • Contributory Pension Plan
  • Life Insurance
  • Disability Benefits
  • Generous Vacation and Holidays
  • Parental Leave
  • Legal Insurance with Identity Theft Protection
  • Employee Assistance Plan
  • Flexible Spending Accounts
  • Health Savings Accounts
  • Wellness Programs
  • Educational Assistance
  • Relocation Assistance
  • Employee Discounts
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service