Spirit Airlines - Dania Beach, FL
posted 4 months ago
The Senior Admin Site Reliability Engineer (SRE) plays a critical role in administering, supporting, and analyzing systems within our environments. This position is pivotal in delivering real-time insights from massive scale data and collaborating with cross-functional teams to develop innovative solutions and enhance positive user experiences. We are seeking an individual who brings fresh perspectives, demonstrates exceptional technical proficiency, and is dedicated to continuously improving our systems and processes. The ideal candidate will have experience monitoring critical applications and coordinating with infrastructure and development teams to streamline and optimize the performance of these applications with a wide range of technologies, automation tools, operating systems, networking concepts, and a desire to continuously improve upon that knowledge. In this role, you will oversee the production environment by monitoring availability and ensuring a holistic view of system health. You will lead efforts to improve reliability, quality, and time-to-market for our suite of software solutions. Additionally, you will spearhead initiatives to measure and optimize system performance, driving innovation and staying ahead of customer needs. Your responsibilities will include providing primary operational support and engineering for multiple large distributed software applications, gathering and analyzing metrics from both operating systems and applications to assist in performance tuning and fault finding, and evaluating incidents after resolution. You will also be responsible for creating sustainable systems and services through automation and uplifts, understanding stages of software development, and documenting different processes. Evaluating existing applications and platforms to provide recommendations for system enhancements will be part of your duties, along with performing daily system monitoring, verifying the integrity and availability of all systems, and reviewing system and application logs. You will define, prioritize, and resolve all support requests in an organized, efficient, and expedited fashion, while also developing, maintaining, and using automation scripts using languages such as JavaScript, PowerShell, and Python. Your role will involve exploring and implementing new ways to automate systems, designing and testing automation equipment and processes, and ensuring data quality and integrity is maintained by standardization of data definitions.