Annapurna Labs - Del Valle, TX

posted 3 months ago

Full-time
Del Valle, TX
Professional, Scientific, and Technical Services

About the position

Amazon Web Services (AWS) provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. AWS has the broadest and deepest set of machine learning and AI services for our customers' businesses. Annapurna Labs, part of AWS, is seeking highly experienced Hardware Test Engineers, System Test Engineers, Manufacturing Test Engineers, and System Validation Engineers to enable high quality and efficient testing for the next generation of our cloud server platforms. Our success depends on our world-class infrastructure as we are handling massive scale and rapid integration of emergent technologies. As a member of the Machine Learning Acceleration team, you will be responsible for the enablement and improvement of our system level manufacturing environment. You will work on developing tests that ensure functionality and capability of our custom hardware used in the AWS server fleet. You will develop expertise in the top-to-bottom functionality of the entire system as well as the intended customer applications and stress the system from a customer perspective. You will work together with other engineering teams to develop, maintain, and improve manufacturing test code for new and existing products. You'll work with both high-level and low-level operating system constructs to create first-boot images for products in manufacturing. You will develop and maintain the deployment and distribution system to ensure that our manufacturing partners have access to appropriate versions of our software as soon as it's available. You will respond to new issues raised by our manufacturing partners, analyze logs and failures, and then develop and deploy solutions to those issues. You will develop documentation as well as testing and debug procedures for our manufacturing partners to follow.

Responsibilities

  • Enable and maintain mass volume production testing, working with our ODMs and JDMs to verify stable high-quality execution
  • Drive ODM and JDM deliveries to ensure production manufacturing quality
  • Identify and develop tests needed to enhance coverage and increase failure granularity
  • Debug test hardware and software used for system level and server level mass production
  • Develop manufacturing tests to exercise hardware components and collect data for large scale analysis

Requirements

  • Bachelor's degree in Electrical Engineering or Computer Engineering
  • 4+ years of experience developing embedded systems code and hardware interfaces (I2C, UART, SPI, JTAG, PCIe, etc.)
  • Experience with Python, BASH or other scripting language
  • Experience analyzing yield and bin pareto
  • Experience working with system management components (BMC, BIOS, CPLD, etc.)
  • Experience with debugging and root cause investigations using hardware schematics and tools such as logic analyzers
  • Strong background working in UNIX environments

Nice-to-haves

  • Experience with C/C++
  • Experience working with fully integrated software/hardware systems
  • Experience exercising server level, PCB level and SOC level components
  • Experience with embedded Linux device drivers and operating system
  • Experience interfacing with JDM or ODMs operations
  • Experience debugging high speed interfaces such as PCIe
  • Experience flashing hardware components in server chassis (BMC, BIOS, CPLD, etc.)
  • Experience working with U-BOOT environment
  • Ability to travel internationally up to 10% of the time
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service