Innova Solutions USA - Fountain Valley, CA

posted about 2 months ago

Full-time - Mid Level
Fountain Valley, CA
10,001+ employees
Professional, Scientific, and Technical Services

About the position

Innova Solutions is seeking a highly skilled Sr. Data Engineer to join our team in Fountain Valley, California. This position is a full-time role that focuses on developing, implementing, and designing data pipelines and ETL processes to efficiently manage large volumes of data. The ideal candidate will have a strong background in PySpark and Big Data development within the Hadoop Ecosystem, with a minimum of 7 years of relevant experience in Big Data and a total of over 12 years of experience in the field. Proficiency in technologies such as Hadoop, Hive, Spark, Unix, SQL, and Python is essential for success in this role. As a Sr. Data Engineer, you will collaborate with cross-functional teams to understand data requirements and create scalable solutions for data storage, processing, and retrieval. You will be responsible for fine-tuning and optimizing data processes to ensure exceptional performance, reliability, and data integrity. Keeping up with the latest industry best practices and emerging technologies in data engineering is crucial, as is addressing and troubleshooting issues related to data pipelines and processing. Additionally, you will participate actively in code reviews, providing constructive feedback to enhance code quality, and offer mentorship to junior team members, fostering a collaborative and innovative work environment. This contract position is expected to run until the end of the year, with the possibility of extension based on project needs.

Responsibilities

  • Develop, implement, and design data pipelines and ETL processes to efficiently ingest, transform, and load large volumes of data.
  • Collaborate with cross-functional teams to gain insights into data requirements and devise scalable solutions for data storage, processing, and retrieval.
  • Fine-tune and optimize data processes to ensure exceptional performance, reliability, and data integrity.
  • Utilize PySpark, Spark, and Hadoop to build robust data solutions.
  • Keep abreast of the latest industry best practices and emerging technologies in data engineering.
  • Address and troubleshoot issues related to data pipelines and processing.
  • Participate actively in code reviews and offer constructive feedback to enhance code quality.
  • Provide mentorship to junior team members and actively foster a collaborative and innovative work environment.

Requirements

  • Minimum of 7 years of relevant experience in Big Data development.
  • Total of 12+ years of experience in data engineering or related fields.
  • Proficiency in Hadoop, Hive, Spark, Unix, SQL, and Python.
  • Strong background in PySpark and Big Data development on the Hadoop Ecosystem.

Benefits

  • Medical & pharmacy coverage
  • Dental/vision insurance
  • 401(k)
  • Health saving account (HSA)
  • Flexible spending account (FSA)
  • Life Insurance
  • Pet Insurance
  • Short term and Long term Disability
  • Accident & Critical illness coverage
  • Pre-paid legal & ID theft protection
  • Sick time
  • Employee Assistance Program (EAP)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service