Data Architect

$101,400 - $183,300/Yr

Leidos - Atlanta, GA

posted 4 months ago

Full-time - Mid Level
Atlanta, GA
Professional, Scientific, and Technical Services

About the position

The Health Mission Solutions is seeking a Data Architect, contingent upon contract award. The Data Architect will work closely with the Program Manager, Cloud Solutions Architect, data engineers, and data analysts to support multiple public health project teams in their full data management cycle. This includes data ingestion, data cleansing, data transformation, data security, data exploration, and data visualization using Microsoft Azure tools and technologies such as Databricks, SPARK Streaming, Azure SQL, Delta Lake, Azure Data Factory, HD Insights, and Notebook. This position requires hands-on experience with data architecture in an Azure cloud environment. Specific roles and responsibilities for the Data Architect position include advising, supporting, and coaching project teams in ingesting data, creating data pipelines, selecting the appropriate Azure services, optimizing data storage, cataloguing data, enforcing technical and architectural standards, and troubleshooting development and production issues. The Data Architect will also design and implement data security measures to ensure PII/PHI data is protected from unauthorized access, and incorporate data governance into the solution design, which includes policies, procedures, and standards for managing and using data. The role requires continuous optimization of the performance of data pipelines in Databricks and Azure Data Factory (ADF), investigating and recommending new technologies to modernize the data pipeline process, and staying current on the latest advancements in data technologies. Collaboration with customer SMEs on data projects to develop data pipeline architectures and strategies is essential, as is mentoring project teams and data engineers on best practices and new technologies. The Data Architect will actively lead and participate in the discovery, validation, and verification process throughout the development life cycle, engage in process improvement initiatives, and identify, evaluate, and demonstrate solutions to complex system problems. Additionally, the role involves designing and developing documentation including procedures, process flow diagrams, work instructions, and protocols for processes.

Responsibilities

  • Advise, support, and coach project teams in ingesting data and creating data pipelines.
  • Select appropriate Azure services and optimize data storage.
  • Catalog data and enforce technical & architectural standards.
  • Troubleshoot development & production issues.
  • Design and implement data security measures for PII/PHI data protection.
  • Incorporate data governance into solution design, including policies and procedures.
  • Continuously optimize performance of data pipelines in Databricks and Azure Data Factory.
  • Investigate and recommend new technologies to modernize data pipeline processes.
  • Collaborate with customer SMEs on data projects to develop data pipeline architectures.
  • Mentor project teams and data engineers on best practices and new technologies.
  • Collaborate with data engineers, business analysts, and testers to implement data architecture in an agile development team.
  • Lead/participate in the discovery/validation/verification process throughout the development life cycle.
  • Engage in process improvement initiatives.
  • Identify, evaluate, and demonstrate solutions to complex system problems.
  • Design and develop documentation including procedures, process flow diagrams, and work instructions.

Requirements

  • Bachelor's degree from an accredited college in a related discipline, or equivalent experience combined with 8+ years of professional experience; or 6+ years of professional experience with a related Master's degree.
  • Proven data architecture experience on a large scale Azure Data Lake platform.
  • Experience onboarding and managing multiple data pipelines of high complexity and processing millions of records per day.
  • Experience working simultaneously with multiple data sources and entities submitting data daily to the data lake.
  • Experience building Azure cloud-based ETL processes and data pipelines to automate data workflows.
  • Experience implementing automated processes to QC data products and pipelines before data release, including de-duplication of data.
  • Experience in handling and delivering big data analytics for daily users.
  • Strong prior experience with and expert knowledge of Databricks, Delta Lake, HD Insights, and Azure Data Factory.
  • Prior experience integrating applications with AI/ML technologies including chatbots.
  • Ability to collaborate with and influence customer leadership and external teams on data initiative strategies.
  • One or more relevant Microsoft Azure certifications.
  • Ability to present complex ideas and subject matter to stakeholders and customer leadership.
  • Proven experience working in a development environment following agile practices and processes.
  • Experience developing documentation including specifications, procedures, process flow diagrams, and protocols for processes.
  • Proven experience with supporting highly critical customer missions.
  • Prior proven leadership experience.
  • Excellent verbal and written communication skills, including experience working directly with customers.

Nice-to-haves

  • Working experience at CDC or other federal agencies.
  • Experience with Azure DevOps and CI/CD pipelines.
  • Azure Databricks Platform Architect certification or similar certifications.
  • Working experience with Tableau, SAS Viya, R, and/or Python.
  • Experience with Transition-In to take over a large scale Azure based data lake platform.
  • Experience with agile development process.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service