Vdart - Dallas, TX
posted 3 months ago
As a Python/PySpark Developer, you will play a crucial role in designing, developing, and deploying scalable data processing pipelines that leverage the power of Python and PySpark. This position is fully remote, allowing you to work from anywhere while contributing to innovative data solutions. You will collaborate closely with data scientists and analysts to gather requirements and translate them into effective technical solutions. Your expertise in writing efficient, optimized, and secure code will be essential for processing, transforming, and analyzing large volumes of data effectively. In this role, you will implement data ingestion processes from various data sources into the data processing platform, ensuring that data flows seamlessly through the system. You will be responsible for creating and maintaining data pipelines and workflows that support data processing and analytics. Performing data quality checks will be a key part of your responsibilities, as you will need to ensure data integrity throughout the system. Additionally, you will troubleshoot and debug production issues, identifying and resolving technical problems as they arise. Collaboration is vital in this position, as you will work with cross-functional teams to ensure that data processing applications integrate seamlessly with other systems. Your contributions will directly impact the efficiency and effectiveness of data-driven decision-making within the organization.