Somatus - McLean, VA

posted 2 months ago

Full-time - Entry Level
McLean, VA
1,001-5,000 employees
Ambulatory Health Care Services

About the position

As the largest and leading value-based kidney care company, Somatus is dedicated to empowering patients living with chronic kidney disease to experience healthier lives at home and fewer days in the hospital. The Data Engineer QA role is crucial in achieving this mission by developing, testing, and maintaining ETL solutions that drive clinical operations, advanced analytics, and machine learning models. This position requires collaboration with various teams, including clinical, operational, and finance, to integrate data from diverse sources for internal and external stakeholders. The Data Engineer QA will ensure that the data processed into the data warehouse is of high quality and reliability, avoiding functional and regression defects. This role involves working closely with cross-functional teams, including product managers, data engineers, BI developers, and data scientists, to identify and resolve issues in data-driven processes. The successful candidate will utilize tools like JIRA and Azure DevOps for managing work and testing execution, create clear defect details, and implement automated testing solutions to ensure the accuracy of data analytics and ETL pipelines. In addition to technical responsibilities, the Data Engineer QA will contribute to the planning process for workstreams, including inception, requirements gathering, technical design, development, testing, and delivery of ETL solutions. The role also requires adherence to security guidelines to safeguard Protected Health Information (PHI) and establish secure communication channels for data transfer. Agile methodologies will be exercised throughout the development process, enhancing collaboration and transparency. This position is not just about technical skills; it embodies the values of Somatus, including authenticity, collaboration, empowerment, innovation, and tenacity. The ideal candidate will thrive in an inclusive work environment that promotes growth and development, contributing to the overall mission of creating More Lives, Better Lived.

Responsibilities

  • Ensure that data processed into our data warehouse is of high quality and reliability, avoiding functional and regression defects.
  • Collaborate closely with cross-functional teams, including product managers, data engineers, BI developers, and data scientists, to identify, report, and help resolve issues in our data-driven processes.
  • Utilize JIRA/Azure DevOps for managing work, test cases, and testing execution.
  • Create clear and concise defect details describing actual versus expected behaviors.
  • Implement automated testing solutions, ensuring the accuracy of Data Analytics and ETL pipelines.
  • Responsible for the technical design, development, testing, maintenance, and optimization of cloud-native data and ETL solutions.
  • Contribute to workstream planning processes including inception, requirements gathering, technical design, development, testing, and delivery of ETL solutions.
  • Collaborate with Analytics & Reporting, Data Science, Machine Learning, Analytics Engineering, IT Infrastructure, and other Technology teams in solution design, development, and deployment.
  • Implement and adhere to security guidelines to safeguard Protected Health Information (PHI) and establish secure communication channels for data transfer.
  • Exercise Agile methodologies throughout the development process, using tools such as JIRA, Confluence, and video communication to enhance collaboration, documentation, and transparency.
  • Follow DevOps/DataOps best practices throughout the Software Development Life Cycle (SDLC).

Requirements

  • Bachelor's degree in computer science, Information Technology, Engineering, Mathematics, or equivalent.
  • 2+ years professional experience in Data Engineering (or similar) role.
  • Experience in designing and implementing data applications and data architectures.
  • Experience with open-source data frameworks like Spark and/or experience with cloud data platforms is preferred.
  • Healthcare experience in a Payer or Provider/Hospital Organization preferred.
  • Experience in Azure data technologies is a bonus (Azure Data Factory, Synapse, Cosmos DB, Azure SQL).
  • Experience with DevOps/DataOps practices is a bonus.
  • Experience or familiarity with Agile or a similar process.
  • 5+ years of experience with at least one database/data warehouse solution (e.g., MySQL, MSSQL, Synapse, Snowflake, RedShift).
  • 2+ years of experience in at least one programming language (preferably Python).
  • Experience using industry standard Python libraries for data exploration, analysis, and transformation (e.g., Pandas, Numpy, etc.).
  • Experience using REST APIs.
  • Proficient in writing SQL Code for SQL queries, views, stored procedures, etc.
  • Experience working with data housed in file formats including TXT, CSV, JSON, YAML, Parquet, XLSX.
  • Problem-solving aptitude and critical thinking skills.
  • Excellent communication and presentation skills.

Nice-to-haves

  • Experience with cloud-native data solutions.
  • Familiarity with machine learning models and analytics engineering.

Benefits

  • Subsidized personal healthcare coverage (medical, dental, vision)
  • Flexible PTO
  • Professional Development, CEU, and Tuition Reimbursement
  • Curated Wellness Benefits supporting teammates' physical and mental well-being
  • Community engagement opportunities
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service