Smx Corporation Limited - Washington, DC

posted about 2 months ago

Full-time - Entry Level
Remote - Washington, DC
Administrative and Support Services

About the position

The Data Engineer Python role at SMX involves designing and implementing efficient data pipelines using Python and Apache Airflow. The position focuses on ensuring high data quality and continuous improvement of data management practices, supporting a remote team based in Washington, DC.

Responsibilities

  • Design, develop, and maintain ETL processes using Python and Apache Airflow.
  • Collaborate with data analysts and other stakeholders to understand and meet their data requirements.
  • Develop and implement data validation processes to ensure high data quality.
  • Troubleshoot and resolve issues related to data pipelines.
  • Optimize data extraction, transformation, and loading processes to improve efficiency and performance.
  • Document and maintain the design and details of data processes and schemas.
  • Stay updated with the latest industry trends and technologies to ensure our data practices remain current.

Requirements

  • Proficiency in Python, including knowledge of libraries like Pandas, NumPy, and Django.
  • Expertise in Apache Airflow, including knowledge of DAGs and Operators.
  • Proficiency in ETL processes, including data extraction, transformation, and loading.
  • Strong understanding of SQL and NoSQL databases, including writing complex queries.
  • Experience with data warehousing solutions like Amazon Redshift, Google BigQuery, or Microsoft Azure SQL Data Warehouse.
  • Strong communication and collaboration skills.
  • Excellent problem-solving skills.

Nice-to-haves

  • Knowledge of data modeling and data warehousing.

Benefits

  • Health insurance
  • Paid leave
  • Retirement
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service