Amazon - North Reading, MA

posted 30 days ago

Full-time - Mid Level
North Reading, MA
Sporting Goods, Hobby, Musical Instrument, Book, and Miscellaneous Retailers

About the position

The Data Engineer position at Amazon Fulfillment Technologies and Robotics focuses on developing and maintaining data pipelines to support the training of state-of-the-art Foundation Models for multi-robot systems. This role is crucial for merging data from diverse sources to enhance the capabilities of Amazon's fleet of over 750,000 mobile robots, ultimately improving automation and robotics solutions at scale.

Responsibilities

  • Work with cross-functional teams to gather and analyze the data requirements for building state-of-the-art AI models.
  • Design, develop, and maintain data pipelines to collect, clean, and store data from multiple diverse sources.
  • Implement data quality and validation mechanisms to ensure data and model integrity.
  • Work closely with Science teams to assist the downstream use cases of their models.
  • Optimize data processing, storage, and retrieval solutions for scalability, cost, and performance tradeoffs.
  • Feedback data issues and opportunities to various teams and support the improvement of data collection practices and processes.

Requirements

  • 3+ years of data engineering experience
  • 3+ years of analyzing and interpreting data with Redshift, Oracle, NoSQL etc. experience
  • Knowledge of distributed systems as it pertains to data storage and computing
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience working on and delivering end to end projects independently
  • Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby
  • Experience with Redshift, Oracle, NoSQL etc.
  • Master's degree in computer science, engineering, analytics, mathematics, statistics, IT or equivalent
  • Familiarity and comfort with Python, SQL, Docker, and Shell scripting. Java preferred but not necessary.

Nice-to-haves

  • Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
  • Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
  • Experience as a data engineer or related specialty (e.g., software engineer, business intelligence engineer, data scientist) with a track record of manipulating, processing, and extracting value from large datasets
  • Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
  • Experience with Apache Spark / Elastic Map Reduce
  • Experience with continuous delivery, infrastructure as code, microservices, in addition to designing and implementing automated data solutions using Apache Airflow, AWS StepFunctions, or equivalent.

Benefits

  • Medical, Dental, and Vision Coverage
  • Maternity and Parental Leave Options
  • Paid Time Off (PTO)
  • 401(k) Plan
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service