Yahoo Holdings - Baltimore, MD

posted 2 months ago

Full-time - Senior
Hybrid - Baltimore, MD

About the position

The Senior Software Development Engineer at Yahoo plays a crucial role in analyzing, designing, programming, debugging, and modifying software enhancements and new products. This position involves leading the development of data warehouse designs in collaboration with a team of Big Data engineers. The engineer will work in an agile Scrum-driven environment, focusing on delivering innovative products that meet the needs of the business. Responsibilities include designing applications, writing code, developing and testing software, debugging, and documenting work and results. Staying up-to-date with relevant technology is essential to maintain and improve the functionality of the applications developed. In this role, the engineer will design and implement reusable frameworks, libraries, and Java components, as well as product features in collaboration with business and IT stakeholders. The position requires ingesting data from various structured and unstructured data sources into Hadoop and other distributed Big Data systems. The engineer will support the sustainment and delivery of an automated ETL pipeline, validate data extracted from sources like HDFS, databases, and other repositories, and enrich and transform extracted data as required. Monitoring and reporting the data flow through the ETL process is also a key responsibility. The Senior Software Development Engineer will perform data extractions, data purges, or data fixes in accordance with internal procedures and policies. Tracking development and operational support via user stories and decomposed technical tasks in issue tracking software, including GIT, Maven, and JIRA, is essential. The engineer will troubleshoot production support issues post-deployment and provide solutions as required. Additionally, mentoring junior engineers within the team is an important aspect of this role, fostering growth and development within the engineering team.

Responsibilities

  • Design and implement reusable frameworks, libraries, and Java components, product features in collaboration with business and IT stakeholders.
  • Ingest data from various structured and unstructured data sources into Hadoop and other distributed Big Data systems.
  • Support the sustainment and delivery of an automated ETL pipeline.
  • Validate data that is extracted from sources like HDFS, databases, and other repositories using scripts and other automated capabilities, logs, and queries.
  • Enrich and transform extracted data, as required.
  • Monitor and report the data flow through the ETL process.
  • Perform data extractions, data purges, or data fixes in accordance with current internal procedures and policies.
  • Track development and operational support via user stories and decomposed technical tasks in a provided issue tracking software, including GIT, Maven, and JIRA.
  • Troubleshoot production support issues post-deployment and come up with solutions as required.
  • Mentor junior engineers within the team for development.

Requirements

  • B.S. or M.S. in Computer Science (or equivalent experience).
  • Five years of related industry experience.
  • Experience in back-end programming, like Java, JS, Python, Node.js and OOAD and ETL Tools.
  • Experience with one of Database technologies (Ex: Vertica, Oracle, Netezza, MySQL, BigQuery).
  • Experience of working with large scale databases.
  • Knowledge and experience of Unix (Linux) Platforms and Shell Scripting.
  • Experience in writing Pig Latin scripts, MapReduce jobs, HiveQL etc.
  • Good knowledge of database structures, theories, principles, and practices.
  • Familiarity with data loading tools like Flume, Sqoop.
  • Knowledge of workflow/schedulers like Oozie, Airflow.
  • Analytical and problem solving skills, applied to Big Data domain.
  • Proven understanding with Hadoop(Dataproc), HBase, Hive, Pig.
  • Knowledge of Cloud providers like AWS, GCP, Azure.
  • Writing high-performance, reliable and maintainable code.
  • Expertise in version control tools like GIT.
  • Good aptitude in multi-threading and concurrency concepts.
  • Effective analytical, troubleshooting and problem-solving skills.
  • Strong customer focus, ownership, urgency and drive.

Benefits

  • Healthcare
  • 401K savings plan
  • Company holidays
  • Vacation
  • Sick time
  • Parental leave
  • Employee assistance program
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service