W3Global - El Segundo, CA

posted 10 days ago

Full-time
El Segundo, CA
Professional, Scientific, and Technical Services

About the position

The Data Architect role focuses on designing and developing data solutions using Google Cloud Platform and its associated tools. The position requires a strong background in data engineering, particularly with Google BigQuery and related data products, to build efficient data pipelines and manage data integration and transformation processes. The role emphasizes hands-on experience with big data technologies and the ability to automate data loads and manage the end-to-end data lifecycle.

Responsibilities

  • Design and develop ETL/ELT frameworks using BigQuery.
  • Build and manage data pipelines using Google Cloud Platform.
  • Automate data loading from BigQuery using APIs or scripting languages.
  • Implement data integration, transformation, and quality processes.
  • Create and manage Airflow DAGs for scheduling and configuration.
  • Utilize big data technologies such as Spark, Hadoop, and Hive.
  • Develop data lineage using DBT in Google Cloud Platform.
  • Conduct end-to-end solution design, including prototyping and usability testing.

Requirements

  • Professional experience in Data Engineering with Google BigQuery and Google Cloud Platform.
  • Hands-on experience with Google Data Products (BigQuery, Dataflow, Dataproc, Dataprep, Cloud Composer, Airflow).
  • Expertise in Python programming, including PySpark and Pandas.
  • Experience with big data technologies and solutions (Spark, Hadoop, Hive, MapReduce).
  • Proficiency in SQL and NoSQL modern data stores.
  • Experience in Dev-Sec-Ops (CICD) environments.

Nice-to-haves

  • Experience with DBT for data lineage creation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service