Rutgers-The State University - New Brunswick, NJ

posted 3 months ago

Full-time - Mid Level
New Brunswick, NJ
Educational Services

About the position

Rutgers, The State University of New Jersey, is seeking a Data Engineer (Biomedical & Clinical Informatics) for the Office of Advanced Research Computing (OARC). This position is integral to supporting the Rutgers University biomedical research community and other research disciplines in their computing and data-related research. The successful candidate will collaborate with research scientists and biomedical informatics specialists to manage and analyze data generated in molecular and cellular biology, genomics, and biomedicine. This will involve utilizing existing data science and machine learning tools, as well as engineering new toolsets that include models, algorithms, and software developed in-house. The Data Engineer will report to the Director of Research Support under the Associate Vice President for Advanced Research Computing. A significant aspect of this role includes engaging in outreach across Rutgers campuses to identify potential new users of research computing and data resources. The incumbent will work closely with faculty, post-doctoral associates, students, and research staff to understand their research needs, allowing for the customization of workflows, training, and support. There is also an opportunity to grow a biomedical and clinical informatics team based on the demand for these services. Key responsibilities include participating in the development of new biomedical and clinical informatics research and support models, potentially leading to the creation of a core facility or center of excellence. The Data Engineer will develop and maintain data pipelines to facilitate the smooth flow and collection of clinical research data, integrating various sources for comprehensive analysis. They will utilize expertise in data wrangling techniques to clean, transform, and prepare raw data for analysis, ensuring data quality and consistency. Additionally, the role involves providing education, outreach, and training to the university community, optimizing data systems for performance, and collaborating closely with Enterprise Infrastructure teams to align data engineering efforts with broader infrastructure strategies.

Responsibilities

  • Support the Rutgers University biomedical research community in their research computing and data-related research.
  • Manage and analyze data in molecular and cellular biology, genomics, and biomedicine.
  • Develop and maintain data pipelines for clinical research data integration and analysis.
  • Utilize data wrangling techniques to clean, transform, and prepare raw data for analysis.
  • Provide education, outreach, and training to the university community on advanced computing.
  • Optimize data systems for performance, ensuring efficient storage and retrieval of data.
  • Collaborate with Enterprise Infrastructure teams to align data engineering efforts with broader strategies.
  • Participate in professional development activities and present at national meetings.
  • Develop and expand internal and external partnerships to promote advanced computing-related collaborations.

Requirements

  • Master's degree required plus a minimum of six (6) years' experience supporting computationally intensive and/or data intensive research projects or equivalent work.
  • Strong verbal and written communication skills.
  • Proven experience in data pipeline building, optimization, and data system architecture within a healthcare or clinical research environment.
  • Expertise in data wrangling techniques, ETL processes, and staging data for analysis.
  • Proficiency in programming languages (e.g., Python, SQL, Java) and experience with database technologies (e.g., SQL Server, PostgreSQL).
  • Deep knowledge of statistical methods and practical knowledge of machine learning model building and deployment.
  • Experience working with Enterprise Infrastructure teams to align data engineering efforts with broader infrastructure strategies.

Nice-to-haves

  • A Ph.D. in a related field.
  • Knowledge of the EPIC electronic medical records system and the EPIC Cognitive Computing Platform (ECCP).
  • Familiarity with data-intensive computing environments.
  • Experience with cloud-based data technologies and distributed computing frameworks (e.g., AWS, Azure, Hadoop, Spark).
  • Experience with statistical tools like SAS, Stata, and SPSS.

Benefits

  • Comprehensive benefit program including health insurance, retirement plans, and paid time off.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service