Oak Ridge National Laboratory - Oak Ridge, TN
posted about 2 months ago
We are hiring a Data Engineer for the Knowledge Discovery Infrastructure (KDI) program at Oak Ridge National Laboratory (ORNL). This role focuses on collaborating with sponsors and research users to deliver and maintain data pipelines, including ETL (Extract, Transform, Load) and reporting workflows, as well as managing data lifecycles. The KDI program is dedicated to facilitating research activities on data entrusted to ORNL by the U.S. Department of Veteran's Affairs (VA) and other agencies. As a Data Engineer, you will have the opportunity to contribute to impactful research and development programs in healthcare informatics, bioinformatics, and computer science. This position is part of the Emerging Technologies & Computing group within the Research Computing division of the Information Technology Services Directorate at ORNL. Our Data Engineers are expected to be proficient in various technologies, including Linux, SQL, Python, containers, Pandas, Spark, and source control, all within a high-security environment. You will be responsible for developing high-scale, robust data warehouses and data lakes, with a particular emphasis on state-of-the-art solutions for healthcare, life sciences, and genomics. In this role, you will work closely with research teams to understand their data needs and provide domain expertise regarding data models, query optimizations, and schema interpretation. You will design, build, and launch new data marts for major national research programs and ETL processes, as well as develop architectures for data intake, curation, organization, and dissemination in support of data science and related fields. Your contributions will help ensure that ORNL's mission is met by aligning behaviors, priorities, and interactions with our core values of Impact, Integrity, Teamwork, Safety, and Service. Additionally, you will promote diversity, equity, inclusion, and accessibility by fostering a respectful workplace.