Unclassified - Woonsocket, RI
posted 5 months ago
The GCP Data Engineer position is a critical role within a leading global IT services and consulting company that specializes in providing a wide range of services across various industries, including banking, financial services, retail, manufacturing, and healthcare. The company is recognized for its commitment to innovation and invests heavily in research and development to maintain its competitive edge in the technology sector. This role requires a deep understanding of Google Cloud architecture and a strong background in database and data warehouse technologies, particularly those offered by Google, such as Big Query, Cloud SQL, Spanner, and Big Table. As a GCP Data Engineer, you will be responsible for building and maintaining the infrastructure necessary for optimal extraction, transformation, and loading (ETL) of data from diverse data sources. This includes utilizing SQL and Google Cloud's big data technologies to create efficient data pipelines using tools like Data Proc Cluster, Data Flow, Pub/Sub, Cloud Composer (Airflow), and Cloud Functions. The position demands fluency in object-oriented programming languages, with a preference for Python, and a solid understanding of distributed systems architecture. Additionally, the role involves implementing data quality processes, which encompass data cleansing, audits, alerts, and triage mechanisms to ensure referential integrity. Familiarity with CI/CD practices, including release and deployment using Google Cloud Build, Git flow, GKE, and Docker, is also essential. This position is designed for individuals who are passionate about data engineering and are eager to contribute to the modernization of data warehousing and cloud-based data lakes.