Ally Financial - Charlotte, NC
posted 4 months ago
As a Senior Site Reliability Engineer (SRE) at Ally Financial, you will play a crucial role in ensuring the reliability and scalability of our complex systems. This position is designed for individuals who are passionate about implementing efficient solutions to prevent and resolve incidents. You will be part of a dynamic team that embodies a startup feel while benefiting from the stability of a well-established company. Your work will directly impact the user experience and system performance, making it essential to advocate for reliability best practices throughout the application development lifecycle. In this role, you will collaborate with cross-functional teams to design, build, and maintain robust, scalable, and fault-tolerant systems. You will work closely with development teams and architects to ensure that reliability is a priority from the outset of application development. Your responsibilities will include designing and implementing monitoring and alerting systems to provide real-time visibility into user experience and system health. You will also monitor and analyze system performance, proactively identifying potential issues and implementing solutions to ensure optimal performance and reliability. Additionally, you will develop and maintain automated tools and processes to streamline operational tasks, participate in incident response and post-mortems, and contribute to continuous improvement efforts. Conducting capacity planning and resource optimization will be key to handling the growing demands on our infrastructure. You will continuously research and evaluate new technologies and practices to enhance the reliability and efficiency of our systems, ensuring that Ally remains at the forefront of technological innovation.