InfoVision - Irving, TX
posted about 2 months ago
We are seeking a skilled Data Engineer with extensive experience in big data processing technologies and data architecture. The ideal candidate will have a strong background in working with various big data platforms such as Cloudera, Horton Works, Snowflake, AWS EMR, RedShift, and AWS Glue. You will be responsible for designing and implementing data pipelines, ensuring the efficient processing and storage of large datasets, and optimizing data workflows to support analytics and reporting needs. In this role, you will work closely with cross-functional teams to understand data requirements and translate them into technical specifications. You will leverage your expertise in Hadoop, Apache Spark, Pyspark, and other big data technologies to build robust data solutions. Your responsibilities will also include maintaining data warehouse technical architecture and infrastructure components, as well as utilizing reporting and analytic tools to derive insights from data. The successful candidate will have experience in building automated ETL processes and data pipelines, ensuring data quality and integrity throughout the data lifecycle. You will also be expected to utilize scripting languages such as Python, Scala, or Shell Scripting to automate tasks and improve efficiency. Familiarity with build tools, version control, unit testing, monitoring, and change management practices to support DevOps is essential for this position.