Healthcare Cloud Data Engineer

$83,200 - $104,000/Yr

The Judge Group - Los Angeles, CA

posted 2 months ago

Full-time
Remote - Los Angeles, CA
Administrative and Support Services

About the position

As a Healthcare Cloud Data Engineer, you will be responsible for developing and maintaining scalable data pipelines, data lakes, and warehouses within cloud environments. Your primary focus will be on securely and efficiently handling healthcare data. You will work extensively with tools like Azure Data Factory (ADF) for orchestrating data workflows and Databricks for big data processing and analytics. Your role will ensure that the healthcare organization's data architecture supports real-time data processing, machine learning models, and business intelligence (BI) tools, while ensuring compliance with healthcare regulations. In this position, you will design and implement cloud-based data infrastructure tailored to healthcare organizations, focusing on scalability, security, and performance. You will build and maintain data lakes and data warehouses for healthcare data, ensuring support for both structured and unstructured data. Additionally, you will develop data pipelines using Azure Data Factory (ADF) for ingesting, transforming, and loading (ETL) data from various sources such as Payer modules, Electronic Health Records (EHR), clinical systems, and external healthcare data sources. You will also implement Databricks for large-scale data processing, data engineering, and machine learning workloads, enabling real-time data analytics and advanced insights. Your responsibilities will include building and maintaining automated data pipelines using ADF for moving and transforming healthcare data between cloud storage, databases, and analytics platforms. You will monitor, optimize, and troubleshoot data pipelines for performance, reliability, and scalability, ensuring that the data architecture meets the needs of the organization.

Responsibilities

  • Design and implement cloud-based data infrastructure tailored to healthcare organizations, focusing on scalability, security, and performance.
  • Build and maintain data lakes and data warehouses for healthcare data, ensuring support for both structured and unstructured data.
  • Develop data pipelines using Azure Data Factory (ADF) for ingesting, transforming, and loading (ETL) data from various sources such as Payer modules, Electronic Health Records (EHR), clinical systems, and external healthcare data sources.
  • Implement Databricks for large-scale data processing, data engineering, and machine learning workloads, enabling real-time data analytics and advanced insights.
  • Build and maintain automated data pipelines using ADF for moving and transforming healthcare data between cloud storage, databases, and analytics platforms.
  • Use Databricks for processing large volumes of healthcare data, including running distributed processing jobs, cleaning, transforming, and enriching data.
  • Monitor, optimize, and troubleshoot data pipelines for performance, reliability, and scalability.
  • Monitor data pipelines and cloud infrastructure for performance bottlenecks, resource usage, and data errors.
  • Optimize workflows, database queries, and big data jobs in Databricks and ADF to ensure efficient system performance with minimal downtime.
  • Use monitoring and alerting tools to ensure data pipelines are running as expected and address issues proactively.

Requirements

  • Bachelor's or Master's degree in Computer Science, Data Engineering, Information Systems, or a related field.
  • 7+ years of proven experience with Azure Data Factory and Databricks for data processing and orchestration in a healthcare context.
  • 5+ years of experience with cloud-based big data platforms like Databricks, and expertise in distributed computing frameworks such as Apache Spark.
  • 7+ years of proficiency in SQL, Python, and Scala for data manipulation and pipeline development.
  • Knowledge of cloud security practices, including encryption, access controls, and auditing in a healthcare environment.
  • Strong problem-solving, analytical thinking, and communication skills.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service