Annapurna Labs - Austin, TX

posted 4 months ago

Full-time
Austin, TX
Professional, Scientific, and Technical Services

About the position

Amazon Web Services (AWS) is seeking highly experienced Hardware Test Engineers, System Test Engineers, Manufacturing Test Engineers, and System Validation Engineers to join the Machine Learning Acceleration team. This team is responsible for enabling high quality and efficient testing for the next generation of our cloud server platforms. As a member of this team, you will play a crucial role in ensuring the functionality and capability of our custom hardware used in the AWS server fleet. Your work will involve developing tests that not only verify the performance of the hardware but also stress the system from a customer perspective, ensuring that it meets the high standards expected by our users. In this role, you will develop expertise in the top-to-bottom functionality of the entire system, collaborating with other engineering teams to maintain and improve manufacturing test code for both new and existing products. You will be responsible for creating first-boot images for products in manufacturing, working with both high-level and low-level operating system constructs. Additionally, you will develop and maintain the deployment and distribution system to ensure that our manufacturing partners have access to the latest software versions as soon as they are available. Your responsibilities will also include responding to new issues raised by our manufacturing partners, analyzing logs and failures, and developing solutions to address these issues. You will create documentation and testing procedures for our manufacturing partners to follow, ensuring that they have the necessary resources to maintain production quality. This position is critical to the success of our world-class infrastructure, as we handle massive scale and rapid integration of emergent technologies.

Responsibilities

  • Enable and maintain mass volume production testing, working with our ODMs and JDMs to verify stable high-quality execution
  • Drive ODM and JDM deliveries to ensure production manufacturing quality
  • Identify and develop tests needed to enhance coverage and increase failure granularity
  • Debug test hardware and software used for system level and server level mass production
  • Develop manufacturing tests to exercise hardware components and collect data for large scale analysis

Requirements

  • Bachelor's degree in Electrical Engineering or Computer Engineering
  • 4+ years of experience developing embedded systems code and hardware interfaces (I2C, UART, SPI, JTAG, PCIe, etc.)
  • Experience with Python, BASH or other scripting language
  • Experience analyzing yield and bin pareto
  • Experience working with system management components (BMC, BIOS, CPLD, etc.)
  • Experience with debugging and root cause investigations using hardware schematics and tools such as logic analyzers
  • Strong background working in UNIX environments

Nice-to-haves

  • Experience with C/C++
  • Experience working with fully integrated software/hardware systems
  • Experience exercising server level, PCB level and SOC level components
  • Experience with embedded Linux device drivers and operating system
  • Experience interfacing with JDM or ODMs operations
  • Experience debugging high speed interfaces such as PCIe
  • Experience flashing hardware components in server chassis (BMC, BIOS, CPLD, etc.)
  • Experience working with U-BOOT environment
  • Ability to travel internationally up to 10% of the time
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service