Weill Cornell Medicine - New York, NY
posted 2 months ago
The Service Operations Analyst II - Infrastructure position at Weill Cornell Medicine is a senior role within the IT Operations department, focusing on providing technical leadership in various infrastructure domains. This includes expertise in Cloud Operations (AWS, GCP, Azure), Backup Configuration, File Share Management, BigFix Administration, and On-Premises Infrastructure management, which encompasses VMware, Windows, Linux Server Management, and DNS. The analyst will play a crucial role in identifying incidents and analyzing problem trends, overseeing the management and resolution of issues, and contributing to root cause analysis and troubleshooting of discovered issues within the Operations Center-supported services. In this role, the analyst will be responsible for monitoring and troubleshooting processes, system triage, and recovery for all infrastructure, applications, and data center environments. They will identify operational risks and propose alternative solutions while participating in the technical escalation of IT issues. Collaboration with both application and operational teams is essential, as the analyst will engage in systems analysis, diagnosis, troubleshooting, performance analysis, and resolution of issues. The position also involves driving problem analysis and incident trending improvement opportunities, working closely with Service Owners and Operational Management to implement continual improvement initiatives. The analyst will document and represent operational requirements in service forums, manage critical incidents, and ensure operational readiness during service transitions. They will serve as an escalation point for junior analysts, providing training and guidance, and will also back up junior analysts in responding to tickets and monitoring event consoles. The role requires administering servers, storage, and backup technologies, assisting with data acquisitions and forensic investigations, and providing ongoing support for production and test/development systems. The analyst will also manage monitoring tools and participate in an on-call rotation to ensure 24x7x365 coverage of mission-critical systems and networks. Compliance with operational level agreements for service requests and adherence to ITIL processes for incident, request, change, and event management are critical aspects of this position.