Annapurna Labs - Pflugerville, TX
posted 3 months ago
Amazon Web Services (AWS) is seeking highly experienced Hardware Test Engineers, System Test Engineers, Manufacturing Test Engineers, and System Validation Engineers to join the Machine Learning Acceleration team. This team is responsible for enabling high-quality and efficient testing for the next generation of cloud server platforms. As a member of this team, you will play a crucial role in developing tests that ensure the functionality and capability of custom hardware used in the AWS server fleet. Your work will involve developing expertise in the entire system's functionality, as well as understanding the intended customer applications to stress the system from a customer perspective. In this role, you will collaborate with other engineering teams to develop, maintain, and improve manufacturing test code for both new and existing products. You will work with high-level and low-level operating system constructs to create first-boot images for products in manufacturing. Additionally, you will be responsible for developing and maintaining the deployment and distribution system to ensure that manufacturing partners have access to the appropriate versions of software as soon as they are available. You will also respond to new issues raised by manufacturing partners, analyze logs and failures, and develop and deploy solutions to those issues. Furthermore, you will create documentation and testing/debug procedures for manufacturing partners to follow. Key responsibilities include enabling and maintaining mass volume production testing, working with Original Design Manufacturers (ODMs) and Joint Design Manufacturers (JDMs) to verify stable high-quality execution, driving ODM and JDM deliveries to ensure production manufacturing quality, identifying and developing tests to enhance coverage and increase failure granularity, debugging test hardware and software used for system-level and server-level mass production, and developing manufacturing tests to exercise hardware components and collect data for large-scale analysis.