Digipulse Technologies - Cherry Hill, NJ

posted 5 days ago

Full-time
Remote - Cherry Hill, NJ
Professional, Scientific, and Technical Services

About the position

This position is critical for designing, developing, and maintaining scalable data pipelines, datasets, and systems for the company's data infrastructure. The role involves collaborating with stakeholders to understand data requirements and developing ETL processes to transform raw data into usable formats for analysis. The candidate will also design and implement data models to support business analytical and reporting needs while ensuring data integrity, consistency, and performance.

Responsibilities

  • Design and develop scalable data pipelines and systems for data infrastructure.
  • Collaborate with stakeholders to understand data requirements.
  • Develop ETL processes to transform raw data into usable formats.
  • Utilize techniques such as partitioning, indexing, and caching for cost-effective data management.
  • Design and implement data models to support business analytical and reporting needs.
  • Ensure data integrity, consistency, and performance across systems.
  • Identify errors and performance issues in data systems.
  • Perform data quality checks for accuracy and consistency of data.
  • Document data pipelines, processes, and best practices for knowledge sharing.
  • Identify and resolve issues to ensure uninterrupted data flow and availability.

Requirements

  • Master's degree in Computer Science, Computer Applications, or Engineering (Mechanical, Information Technology, Civil, Electrical, Electronics).
  • Bachelor's degree in relevant fields plus five years of progressive experience in lieu of a Master's degree.
  • Experience with at least three of the following tools: Oracle, SQL Server, UNIX, Python, AWS, Azure, Git, Apache (Spark, Kafka, Airflow), Google BigQuery, SQL, ETL, Informatica, DB2, Snowflake.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service