Endava - Berkeley Heights, NJ
posted 3 months ago
As a Digital Apps Site Reliability Engineer (SRE) at GalaxE Solutions, you will play a crucial role in providing hands-on support for existing environments. Your responsibilities will encompass a wide range of tasks including software installation, patch installation, upgrades, query writing, configuration, security, system monitoring and tuning, disaster recovery planning, and release deployments. You will be expected to provide 24x7 support for production Internet applications on a rotating basis, acting as a point of escalation for application support to diagnose and resolve complex customer issues related to the Portal and Web Services environments. In this role, you will drive incident crisis technical bridges and management bridges as required, leveraging your experience and organizational knowledge to reduce Mean Time to Recovery (MTTR). You will collaborate with Change Management and Release Managers to review proposed change events for production and participate in all Production Support activities during incidents and outages. As a hands-on technical resource, you will be capable of resolving all technical issues within lower and upper environments and making recommendations for performance and capacity improvements. Documentation is a key aspect of this role; you will be responsible for documenting install defects, assigning severity to problems, and performing postmortems to identify root cause analysis (RCA) after fallbacks. You will also participate in internal and external audits as required by management and work closely with Engineering to ensure all relevant Key Performance Indicators (KPIs) are implemented within the monitoring framework. Additionally, you will escalate issues to technology, operations, and/or vendors where appropriate, ensuring that database/application controls and procedures remain compliant with Corporate IT risk. Supporting Disaster Recovery tests and live recovery for all production environments will also be part of your responsibilities.