Photon - Louisville, KY

posted 9 days ago

Full-time - Mid Level
Louisville, KY
Professional, Scientific, and Technical Services

About the position

We are seeking a skilled Data Analyst to join our Data Lake Discovery Program. The ideal candidate will have experience in working with large-scale data systems, performing data analysis, and supporting the design and implementation of data lakes. You will be responsible for analyzing, modeling, and transforming data, ensuring data quality, and supporting key stakeholders in leveraging the data lake for business insights.

Responsibilities

  • Collaborate with cross-functional teams to understand business requirements and define data collection, transformation, and analysis processes within the data lake.
  • Perform detailed data analysis, including cleansing, transformation, aggregation, and validation, across structured and unstructured datasets.
  • Assist in the design and development of data ingestion pipelines for batch and real-time data into the data lake.
  • Work with architects and engineers to ensure proper schema design and adherence to data lake best practices.
  • Ensure data integrity, accuracy, and governance through data quality monitoring and troubleshooting.
  • Collaborate with data engineers to optimize data storage, retrieval, and performance across the data lake.
  • Conduct exploratory data analysis (EDA) to discover patterns, insights, and trends, and provide recommendations to business stakeholders.
  • Provide support for API and batch message integration within the data lake, assisting with testing and validation of data flows.
  • Stay updated with emerging trends and technologies in data lake architectures, data governance, and analytics.

Requirements

  • 3+ years of experience as a Data Analyst, with strong proficiency in data analysis, data management, and reporting.
  • Experience working with data lakes, cloud platforms (Azure, AWS, GCP), and large-scale data systems.
  • Proven experience in data modeling, ETL processes, and working with structured and unstructured data.
  • Experience with data visualization tools (e.g., Tableau, Power BI).
  • Familiarity with cloud-based data lakes and tools like Databricks, Delta Lake, Apache Spark, or Hadoop.
  • Knowledge of data governance principles, data quality management, and schema design best practices.
  • Strong analytical and problem-solving skills with attention to detail.
  • Excellent communication and interpersonal skills, with the ability to translate complex data insights for non-technical stakeholders.
  • Ability to work independently as well as in a collaborative team environment.

Nice-to-haves

  • Experience with data lake architecture and data governance frameworks.
  • Familiarity with API and message design for data ingestion and integration.
  • Knowledge of regulatory compliance requirements (e.g., CCPA) in data handling and privacy.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service