Amazon.composted about 1 month ago
$129,300 - $223,600/Yr
Full-time • Mid Level
Redmond, WA
General Merchandise Retailers

About the position

Amazon Web Services Open Data Analytics (ODA) organization is looking for exceptional engineers to help in our mission to provide the world's best cloud Big Data processing platform and services such as EMR and Athena. The ODA engines team is looking for an experienced engineer to join the core engines and datalake team. Athena and EMR are services that our customer use to run large scale analytics, leveraging open source engines like Apache Spark and Trino, with datalake open table formats like Apache Iceberg, Hudi and Delta. The analytics engines organization makes significant modifications to these engines to run in serverless environments and with superior performance and scalability than what is available in Open Source. In the last 3 years we have improved our engines by a factor of 5x by making changes to the optimizer, query runtime and storage connectors. We have also made significant changes to the compiler to enable enterprise features like fine grain access control with these engines and table formats. Additionally, we strive to regularly contribute features, bug fixes and optimizations back to open-source, as well be current with the latest open-source versions of these frameworks. This is a "must-win" strategic area in a growing and very technical space. We are seeking a passionate and hands-on engineer to collaborate closely with open-source communities like Apache Iceberg and Apache Spark, driving innovations in query engines and table format integrations. In this role, you will focus on performance optimizations, feature enhancements, stability improvements, and security hardening, making deep contributions across the query engine and table format codebases. As a key member of the Engines team, you will shape the technical direction, influence design decisions, implement critical features, and foster collaboration with both internal teams and the open-source community.

Responsibilities

  • Develop and optimize core components of query engines and open table formats (Iceberg, Hudi, Delta) to enhance performance, scalability, and reliability.
  • Design and implement innovative solutions and algorithms to improve feature capabilities, stability, and security in table format integrations with query engines.
  • Collaborate with the open-source community, contributing to discussions, driving improvements, and integrating upstream changes.
  • Ensure data consistency and durability while achieving breakthrough performance and scalability for large-scale data lake workloads.
  • Improve the organizations automation and testing capabilities.
  • Manage complex deliverables project and research projects with deadlines.
  • Mentor and train other team members on design techniques and coding best practices.
  • Be a point of contact for challenging customer issues related to data lake workloads and query engine.

Requirements

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • 3+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience

Nice-to-haves

  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Bachelor's degree in computer science or equivalent
  • Experience in developing and operating distributed systems or applications at large scale
  • Experience working on open table formats (Iceberg, Hudi, Delta) or query engines (Spark, Trino, Flink etc) is a huge plus
  • Experience contributing to open source code bases, and collaborating with open source communities

Benefits

  • 401k
  • health_insurance
  • dental_insurance
  • vision_insurance
  • life_insurance
  • disability_insurance
  • paid_holidays
  • paid_volunteer_time
  • tuition_reimbursement
  • employee_stock_purchase_plan
  • performance_bonus
  • sign_on_bonus

Job Keywords

Hard Skills
  • Apache Iceberg
  • Apache Spark
  • Code Review
  • Test Automation
  • Web Services
  • 6jfrhtzo dnPmZRibOCyg
  • BkAT6w9 UkVgy8
  • EShV6k OeEQdHiwYPKnu
  • fr9XomnOp fqBvnTYZoOGU
  • iGMY5SR1DUQxt dznBwG8 oLMgNTEfkV
  • jcFDfRIPh pDa7fMJb
  • mH3Dj qo6SutACbUwg
  • Pm30TBCUnpc 8Tsdc21
  • QEblU 6N3zcnLdsVu
  • qGPTlzukF E9jb32aVNvR4
  • Qnosg6rv7Ktf 83vmfiHt2I
  • trq wkaUeXFpOHEC9 ijF1ba6
  • uNxtkb9Eyp8F3 zg9oEe0dA7SD
  • XvQ6d uAslBe1IvX4
  • zEMl5LjDZ RXkt67me
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service