Harbinger Health - Cambridge, MA

posted 3 months ago

Full-time
Cambridge, MA

About the position

As the Data Architecture Lead at Harbinger Health, you will play a pivotal role in shaping the data framework and assets for our digital cloud foundation on AWS. This position requires close collaboration with the Cloud Architect to align data architectures with our enterprise, scientific, and business strategies while enforcing standards for data solutions. You will be responsible for ensuring that data systems and applications meet service level targets (SLTs) and adhere to strict information security and privacy standards. Reporting to the Senior Director / Head of IT, you will work alongside Data Science, Research, Clinical, and TechOps teams to spearhead the architecture design and deployment of data products that support the goals of Data Science, Bioinformatics, and Machine Learning. Additionally, you will provide data marts and analytical tools to assist other business operations. In this role, you will develop and manage a comprehensive vision for data architecture using platforms like Snowflake and AWS, which includes the design of data structures and business transformation logic. You will analyze business requirements to create conceptual and detailed data models, blueprints, and roadmaps that enable advanced data products. Ensuring adherence to data quality standards and compliance with health and life science regulatory requirements (such as HIPAA, PHI, and PII) will be a critical part of your responsibilities. You will design logical data models based on existing applications and databases and transform these into physical configurations. Collaboration with vendors and team members will be essential to establish and maintain data structures, while you will also maintain and evolve data architectures to incorporate the latest cloud and big data technologies. Your role will involve developing proofs of concept to validate new data-driven solutions, building and governing on-prem and cloud-native databases, operational data stores, data marts, and data lakes. You will work with BI and Analytics teams to create optimized, reusable semantic models, ensuring metadata and lineage clarity. Setting standards for data modeling and design, you will promote best practices across the company and mentor operational and business teams on data architecture requirements and designs. Upholding metadata standards to ensure clarity in data models, structures, and semantics will also be part of your duties.

Responsibilities

  • Develop and manage a comprehensive vision for data architecture using platforms like Snowflake and AWS, including the design of data structures and business transformation logic.
  • Analyze business requirements to create conceptual and detailed data models, blueprints, and roadmaps that enable advanced data products.
  • Ensure adherence to data quality standards and compliance with health and life science regulatory requirements (HIPAA, PHI, PII).
  • Design logical data models based on existing applications and databases and transform these into physical configurations.
  • Collaborate with vendors and team members to establish and maintain data structures.
  • Maintain and evolve data architectures to incorporate the latest cloud and big data technologies.
  • Develop proofs of concept to validate new data-driven solutions.
  • Build and govern on-prem and cloud-native databases, operational data stores, data marts, and data lakes.
  • Work with BI and Analytics teams to create optimized, reusable semantic models, ensuring metadata and lineage clarity.
  • Set standards for data modeling and design, promoting best practices across the company.
  • Mentor operational and business teams on data architecture requirements and designs.
  • Uphold metadata standards to ensure clarity in data models, structures, and semantics.

Requirements

  • BS or MS in Computer Science, Life Science, or a related scientific field.
  • 8-10+ years' industry experience in data architecture, data management, or as a Lead Data Engineer.
  • 3+ years' experience in the Biotech/Life Sciences industry.
  • Extensive knowledge of big data technologies and experience in modern data architecture designs such as data lakes, data meshes, and data fabrics.
  • Proficiency in object-oriented programming languages like Scala, Java, and Python, and expertise in various database types (transactional, object, document, and graph).
  • Experience in managing implementation and service vendors.
  • Familiarity with enterprise data governance programs.
  • Comprehensive experience with cloud technologies, APIs, backup solutions, Disaster Recovery, and IT Ops.
  • Skills in designing and building data processing pipelines using tools like Nextflow or Airflow.
  • An understanding of the Machine Learning lifecycle and its applications.
  • Strong analytical, problem-solving, and documentation skills capable of handling complex challenges independently.

Nice-to-haves

  • Experience working in the diagnostics industry.
  • Experience working within CLIA/CAP laboratories and/or under FDA regulated products.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service