Chewy - Richardson, TX
posted 3 months ago
As a Site Reliability Engineer II at Chewy, you will play a crucial role in enhancing site reliability and resiliency, managing system operations, and implementing infrastructure as code. This position is based in Richardson, Texas, and involves leveraging AWS services and containerization techniques to ensure a seamless transition of applications to production. You will be responsible for supporting the implementation and management of Chewy platform standards, which are essential for maintaining high availability and performance of our services. Your primary focus will be on creating a comprehensive framework for automating and optimizing processes, thereby reducing the need for manual intervention. You will utilize tools such as Python and Terraform to achieve efficient process automation and establish a robust framework for site reliability that can be measured and reported to our customers. Additionally, you will implement scalable processes using various automation tools and take charge of maintaining security hardening on the Load Balancer end, overseeing regular upgrades and software maintenance. In this role, you will also engage in daily operations and regular developer/admin activities on the Chewy platform, sharing reports across the organization to ensure transparency and accountability. Your contributions will be vital in ensuring that our systems are reliable, secure, and performant, ultimately enhancing the customer experience.