Wipro - Pittsburgh, PA
posted 3 months ago
Wipro Limited is seeking a Python PySpark Developer to join our team in Pittsburgh, Pennsylvania. The ideal candidate will have a strong background in data integration and pipeline development, with at least 4 years of relevant experience. This role requires expertise in AWS Cloud technologies, particularly in integrating data using Apache Spark, EMR, Glue, Kafka, Kinesis, and Lambda within S3, Redshift, RDS, and MongoDB/DynamoDB ecosystems. The successful candidate will have a proven track record in Python development, especially with PySpark in an AWS Cloud environment. In this position, you will be responsible for designing, developing, testing, deploying, maintaining, and improving data integration pipelines. You will leverage your strong analytical skills to write complex queries, optimize them, and debug issues as they arise. Familiarity with source control systems such as Git, Bitbucket, and Jenkins for build and continuous integration is essential. Experience with Databricks or Apache Spark is considered a plus. Your responsibilities will include innovating data integration solutions on our Apache Spark-based platform, ensuring that technology solutions utilize cutting-edge integration capabilities. You will facilitate requirements gathering and process mapping workshops, review business and functional requirement documents, and author technical design documents, testing plans, and scripts. Additionally, you will assist in implementing standard operating procedures and facilitate review sessions with functional owners and end-user representatives, using your technical knowledge to drive improvements.