Ally - Raleigh, NC
posted 4 months ago
Ally Financial is seeking a talented and motivated Site Reliability Engineer (SRE) to join our dynamic team. This role is crucial for ensuring the reliability and scalability of complex systems, and the ideal candidate will thrive on implementing efficient solutions to prevent and resolve incidents. As an SRE, you will be responsible for managing the SRE Team, which includes both Ally employees and contractors. You will collaborate with cross-functional teams to design, build, and maintain robust, scalable, and fault-tolerant systems. Your work will involve close collaboration with development teams and architects to advocate for reliability best practices throughout the application development lifecycle. In this position, you will design and implement monitoring and alerting systems to provide real-time visibility into user experience and system health. You will monitor and analyze system performance, proactively identifying potential issues and implementing solutions to ensure optimal performance and reliability. Additionally, you will develop and maintain automated tools and processes to streamline operational tasks and reduce manual interventions. Participation in incident response and post-mortems will be part of your responsibilities, contributing to continuous improvement efforts. You will also conduct capacity planning and resource optimization to handle growing demands on our infrastructure, continuously researching and evaluating new technologies and practices to enhance the reliability and efficiency of our systems. At Ally, we pride ourselves on fostering a culture that values diverse thinking and supports one another. We are relentless in finding new ways technology can help make experiences better and help people. If you are passionate about technology and want to make a real impact, this is the opportunity for you.