Workday - Atlanta, GA
posted 2 months ago
As a Senior Site Reliability Engineer (SRE) at Workday, Inc., you will play a crucial role in ensuring the reliability and performance of our services across various environments, including production, sandbox, implementation, sales, training, and partner services. Your primary responsibilities will involve developing, supporting, and enhancing utilities that automate manual tasks and streamline processes. You will also be engaged in server capacity additions on both Baremetal and Private cloud infrastructures. In this position, you will be responsible for upgrading, patching, executing, and monitoring the processes that keep the Workday Service operational. This includes creating and maintaining scripts, applying patches, and making configuration changes to our systems, either manually or through automation tools. A key aspect of your role will be to ensure that we consistently meet our Service Level Agreements (SLAs). You will also be tasked with identifying, documenting, and following up on issues encountered during all phases of service delivery. Additionally, you will work on enhancements and improvements for monitoring, alerting, and tracing not only internal services but also, most importantly, production services. Your contributions will be vital in maintaining the high standards of service reliability that our customers expect from Workday.