NTT DATA - Frisco, TX
posted about 2 months ago
NTT DATA is seeking a QA DATA Engineer to join our team remotely, based in Frisco, Texas. This role is pivotal in ensuring the quality and reliability of our data pipelines within an agile development environment. The successful candidate will be responsible for designing, developing, and maintaining comprehensive test plans and test cases tailored for scalable data pipelines. This includes implementing automated testing solutions using Python and PySpark on a cloud-native Lakehouse data platform, as well as writing efficient SQL queries to validate data extraction, transformation, and loading processes. Collaboration is key in this role, as you will work closely with product management and analysts to understand data requirements and ensure quality assurance throughout the development lifecycle. You will also be tasked with optimizing and troubleshooting data pipelines to enhance performance and reliability, ensuring data quality and integrity through rigorous testing and validation processes. Following DevOps principles, you will utilize CI/CD practices to deploy and operate automated testing frameworks, contributing to a culture of continuous improvement and innovation. The ideal candidate will possess a strong background in quality assurance, with a focus on data engineering. You will need to demonstrate proficiency in Python and PySpark, along with a solid understanding of SQL and database management. Familiarity with software development patterns and best practices in quality assurance is essential, as is experience with ETL/ELT processes and data pipeline testing. Your ability to develop using version control and automated testing tools, particularly git-based tools like GitHub and GitHub Actions, will be crucial to your success in this role.