Dynatrace - Detroit, MI
posted 3 months ago
As a Site Reliability Engineer at Dynatrace, you will work in a dynamic and secure cloud environment, leveraging your expertise across multiple IT disciplines and technologies. Your primary responsibility will be to maintain the service infrastructure, ensuring high availability, performance, and an optimal customer experience. You will autonomously deploy and update systems, services, and supporting infrastructure, streamlining processes to enhance efficiency. Security and compliance are paramount in this highly regulated environment, and you will focus on implementing fully automated processes to reduce manual work and increase productivity. Collaboration is key, as you will work alongside motivated engineers from diverse backgrounds in software engineering, system engineering, and product management, all within a supportive work environment that encourages growth and work-life balance. In your role, you will develop and implement automation solutions aimed at improving operational efficiency and minimizing manual tasks. You will plan and manage system capacity to optimize resource usage and cost efficiency, while coordinating and overseeing the release and deployment of products. Continuous improvement of existing processes will be a focus, as will automating monitoring and alerting to enhance the efficiency, security, and reliability of the cloud infrastructure. Additionally, you will investigate and resolve production incidents, providing support to ensure customer success, and utilize the Dynatrace platform to monitor and optimize system performance and user experience.