Lantern Pharma - Plano, TX

posted about 2 months ago

Full-time - Mid Level
Plano, TX
Professional, Scientific, and Technical Services

About the position

Lantern Pharma is seeking a talented and highly motivated Lead Data/Infrastructure Engineer to build and enhance our RADR AI platform that can predict drug response, therapeutic benefit, and survival outcome of cancer patients. In this role, you will work directly with our RADR team to take ownership of developing the engineering side of our platform, focusing on creating a robust backend data infrastructure that supports both Lantern's internal AI scientists and external-facing partnerships. This position offers the opportunity to work with cutting-edge data tools and be part of a team at the forefront of AI-driven oncology. The ideal candidate will possess an entrepreneurial mindset, be comfortable with creative problem-solving, and be excited to build engineering infrastructure from the ground up in a flexible, fast-paced environment. You will play a crucial role in the evolution of Lantern's science and platform, providing opportunities for growth and significant impact within the organization. Your contributions will directly influence the development of precision oncology solutions that can transform drug development processes. As a Lead Data/Infrastructure Engineer, you will help set the company's overall engineering strategy and growth, which includes developing technical infrastructure, assessing security, and building the engineering team(s). You will adopt a player/coach mentality, working hands-on to build out data pipelines and infrastructure while also managing a team of engineers to support overall data engineering and infrastructure goals. Your responsibilities will include taking ownership of supporting, architecting, and building a system for efficient data ingress/ETL pipelines for both existing data assets and newly identified data sources. Additionally, you will manage engineering activities with cloud systems (AWS), including data transfers, budget estimates, and engaging with external partners such as cloud providers. You will collaborate with an interdisciplinary team of computational biologists and machine learning scientists across multiple projects, including data ingest jobs, MLOps workflows, and project architecture. Furthermore, you will identify and track key performance indicators to be shared both internally and with company stakeholders, and initiate the incorporation of emerging technology and open-source projects to drive overall engineering goals.

Responsibilities

  • Help set company's overall engineering strategy and growth including developing technical infrastructure, accessing security, and building the engineering team(s)
  • Work hands-on with building out data pipelines and infrastructure while managing a team of engineers to support overall data engineering and infrastructure goals
  • Take ownership of supporting, architecting, and building a system for efficient data ingress/ETL pipelines for both existing data assets and newly identified data sources
  • Manage engineering activities with cloud systems (AWS) including data transfers, budget estimates and engaging with external partners (cloud providers, etc.)
  • Work with an interdisciplinary team consisting of computational biologists and machine learning scientists across multiple projects including data ingest jobs, MLOps workflows, and project architecture
  • Identify and track key performance indicators to be shared both internally and with company stakeholders
  • Initiate the incorporation of emerging technology and open-source projects to drive overall engineering goals

Requirements

  • 5-7 years experience working in Engineering/Development environments
  • Experience building optimized and scalable data infrastructures using open source and cloud native technologies
  • Understanding of modern engineering design principles (distributed systems, stateless processes, etc)
  • Experience building ETL pipelines in a cloud native environment, with bonus points for Luigi, Airflow, or AWS Glue
  • Proficient working knowledge of relational and non-relational databases
  • Proficient working knowledge of AWS
  • Experience using GenAI tools for automated data meta-tagging and querying
  • Working knowledge of scripting languages, Python strongly preferred
  • Experience in Version control and Agile
  • Strong leadership and communication skills
  • Flexible in work environment with time onsite in Plano at the Lantern office

Nice-to-haves

  • Experience working with biological data or at a biopharmaceutical company
  • Experience using CodeOcean
  • Experience with Data Lakehouse architectures
  • Experience in data science and AI

Benefits

  • Competitive health, dental & vision insurance
  • Stock options in a public company
  • Opportunity to take leadership on new and meaningful projects
  • Involvement with leading conferences & industry trade shows
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service