Wipro - O'Fallon, MO

posted 4 months ago

Full-time - Senior
O'Fallon, MO
10,001+ employees
Professional, Scientific, and Technical Services

About the position

Wipro Limited is seeking a highly skilled Databricks Architect to join our team in O'Fallon, Missouri. This role is pivotal in driving our clients' digital transformation initiatives by leveraging advanced data solutions. The ideal candidate will have over 10 years of experience in data architecture, specifically with a strong focus on Databricks and related technologies. The Databricks Architect will be responsible for designing and implementing complex data solutions that align with business objectives, ensuring data governance, integration, and security across various platforms. In this role, you will be expected to have hands-on experience with PySpark and Apache Spark, as well as a proven track record in migrating native Spark and Hadoop environments to Databricks. You will also be tasked with building robust orchestration layers using Databricks and Azure Data Factory (ADF), creating CI/CD pipelines for Databricks in Azure DevOps, and processing near real-time data through Auto Loader and DLT pipelines. Additionally, you will implement security layers in Delta Lake and develop cost-effective infrastructure solutions within Databricks. The successful candidate will demonstrate expertise in data modeling, integration, and governance, with a strong ability to extract logic from on-premises layers such as SSIS, stored procedures, and Informatica into PySpark. You will also need to be cloud-agnostic, capable of building solutions that are not limited to a single cloud provider. Excellent collaboration and communication skills are essential, as you will work closely with various stakeholders to establish best practices for business optimizations and ensure the successful deployment of data solutions.

Responsibilities

  • Design and implement complex data solutions aligned with business objectives.
  • Migrate native Spark and Hadoop environments to Databricks.
  • Build strong orchestration layers in Databricks and Azure Data Factory (ADF).
  • Create CI/CD pipelines for Databricks in Azure DevOps.
  • Process near real-time data through Auto Loader and DLT pipelines.
  • Implement security layers in Delta Lake.
  • Develop cost-effective infrastructure solutions in Databricks.
  • Extract logic from on-premises layers into PySpark.
  • Establish best practices for business optimizations.
  • Guide the definition of virtual data models and data virtualization architecture.

Requirements

  • 10+ years of experience in data architecture and engineering.
  • Strong hands-on experience with PySpark and Apache Spark.
  • Experience in migrating native Spark and Hadoop to Databricks.
  • Proven experience in building data governance solutions like Unity Catalog.
  • Expertise in data modeling, integration, security, and governance.
  • Hands-on experience with Azure, Databricks, and PySpark technologies.
  • Experience with relational and non-relational data stores (Hadoop, SQL, MongoDB).
  • Proficiency in ETL or ELT tools (SSIS, Informatica, Matillion, DBT).
  • In-depth experience with data governance and data integration technologies.
  • Excellent collaboration and communication skills.

Nice-to-haves

  • Knowledge of cloud-based data solutions (e.g., AWS, Azure).
  • Experience with data lake and data fabric concepts.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service