American Express - Sunrise, FL
posted 2 months ago
As a Senior Engineer in Site Reliability at American Express, you will play a pivotal role in ensuring the reliability and performance of our software systems. This position is part of a global Site Reliability Engineering (SRE) organization, where you will collaborate with Core Engineering and Platform Teams to align engineering efforts with strategic goals. Your work will involve managing complex challenges unique to American Express, leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design. You will be part of a diverse tech team that values collaboration, intellectual curiosity, and problem-solving, fostering an environment where your contributions are recognized and valued. In this role, you will be responsible for the technical aspects of software reliability for assigned applications, which includes designing, developing prototypes, and coding assignments. You will function as a leader on an agile team, contributing to software builds through consistent development practices, participating in architectural decisions, and leading code reviews and automated testing. Your responsibilities will also include debugging software components, consulting with teams to build standards for high availability, and implementing orchestration and automation solutions to enhance accuracy and reduce defects. You will drive monitoring requirements to ensure business-service level visibility and provide mentorship to software engineers on design patterns that resist failure. Additionally, you will introduce new technologies to the production support toolchain, helping to minimize friction during production releases and improving incident recovery processes. Your role will also involve facilitating the resolution of non-application issues, ensuring operational readiness throughout the application lifecycle, and being an efficiency multiplier for your team by analyzing workflows and driving productivity.