Oracle - Columbus, OH
posted 3 months ago
Oracle's Cloud Infrastructure (OCI) National Security Sector Group is at the forefront of building and operating large-scale distributed infrastructure for the cloud, specifically tailored to meet the needs of government customers. This role is pivotal in supporting Oracle's mission to deliver an enterprise-level cloud infrastructure platform that ensures unmatched reliability, scalability, and performance for critical databases, applications, and workloads. The Autonomous Database Team plays a crucial role in this mission by developing the cloud service framework that powers various Oracle Autonomous Database cloud services, including Autonomous Data Warehouse (ADW) and Autonomous Transaction Processing (ATP). This framework automates the deployment, scaling, and management of databases in the cloud, leveraging Oracle's Cloud Infrastructure (OCI) Layer. As a Site Reliability Engineer (SRE), you will be responsible for defining and deploying autonomous database services with a strong emphasis on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will collaborate with multiple multi-functional teams to deliver exceptional experiences to our collaborators while ensuring the reliability and performance of our services. This role requires a proactive approach to incident management, where you will act as a point of escalation for incidents and other issues arising within the region for cloud database services. You will also be responsible for operating and maintaining cloud database services, deploying code, and implementing changes within the region. In addition to these responsibilities, you will take ownership of the implementation and production operations of a wide array of core system platform solutions. Your role will involve continuously implementing automation, self-healing mechanisms, and real-time monitoring to enhance production systems. Thorough documentation of incidents through company-standard reporting methods is essential, as is staying informed about cloud infrastructure stacks. You will drive and actively participate in resolving complex technical issues that span various services, ensuring that our cloud offerings remain robust and reliable.