Oracle - Columbus, OH

posted 3 months ago

Full-time - Mid Level
Columbus, OH
Publishing Industries

About the position

Oracle's Cloud Infrastructure (OCI) National Security Sector Group is at the forefront of building and operating large-scale distributed infrastructure for the cloud, specifically tailored to meet the needs of government customers. This role is pivotal in supporting Oracle's mission to deliver an enterprise-level cloud infrastructure platform that ensures unmatched reliability, scalability, and performance for critical databases, applications, and workloads. The Autonomous Database Team plays a crucial role in this mission by developing the cloud service framework that powers various Oracle Autonomous Database cloud services, including Autonomous Data Warehouse (ADW) and Autonomous Transaction Processing (ATP). This framework automates the deployment, scaling, and management of databases in the cloud, leveraging Oracle's Cloud Infrastructure (OCI) Layer. As a Site Reliability Engineer (SRE), you will be responsible for defining and deploying autonomous database services with a strong emphasis on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will collaborate with multiple multi-functional teams to deliver exceptional experiences to our collaborators while ensuring the reliability and performance of our services. This role requires a proactive approach to incident management, where you will act as a point of escalation for incidents and other issues arising within the region for cloud database services. You will also be responsible for operating and maintaining cloud database services, deploying code, and implementing changes within the region. In addition to these responsibilities, you will take ownership of the implementation and production operations of a wide array of core system platform solutions. Your role will involve continuously implementing automation, self-healing mechanisms, and real-time monitoring to enhance production systems. Thorough documentation of incidents through company-standard reporting methods is essential, as is staying informed about cloud infrastructure stacks. You will drive and actively participate in resolving complex technical issues that span various services, ensuring that our cloud offerings remain robust and reliable.

Responsibilities

  • Act as a point of escalation for incidents and other issues arising within the region for cloud database services.
  • Operate and perform maintenance on cloud database services running within the region.
  • Deploy code and implement other changes within the region.
  • Take ownership of the implementation and production operations of a wide array of core system platform solutions.
  • React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems.
  • Ensure thorough documentation of incidents through company-standard reporting methods.
  • Stay informed of cloud infrastructure stacks.
  • Drive and actively participate in the resolution of complex technical issues spanning various services.

Requirements

  • Ability to maintain a US government security clearance.
  • At least a Bachelor's degree in Computer Science, MIS, or another technical field, or equivalent work experience.
  • Solid experience with Linux.
  • Experience troubleshooting complex software and/or networking issues.
  • Solid understanding of cloud concepts and platforms.
  • Expert level experience, understanding, implementation, and troubleshooting of Oracle Database technology including RAC, Dataguard, ASM, RMAN preferred.
  • Development skills using Python, shell, SQL.
  • Expert knowledge and in-depth experience of Oracle Engineered systems and subsystems, especially Exadata.
  • Ability to troubleshoot and resolve complex hardware/software issues, restore environments to an operational state, perform root cause analysis and provide forward-thinking mitigation strategies.
  • Good communication and analytical skills.
  • Familiarity with security practices in web application delivery and general knowledge of network topology.
  • Demonstrable ability to quickly learn new technical domains and then train others.

Nice-to-haves

  • Experience in cloud technical support, operations, NOC or similar is preferred, but not required.
  • Experience working with government customers is preferred, but not required.

Benefits

  • Medical, dental, and vision insurance, including expert medical opinion.
  • Short term disability and long term disability.
  • Life insurance and AD&D.
  • Supplemental life insurance (Employee/Spouse/Child).
  • Health care and dependent care Flexible Spending Accounts.
  • Pre-tax commuter and parking benefits.
  • 401(k) Savings and Investment Plan with company match.
  • Flexible vacation policy with accrued vacation based on hours worked.
  • 11 paid holidays.
  • Paid sick leave: 72 hours upon date of hire, refreshing each calendar year.
  • Paid parental leave.
  • Adoption assistance.
  • Employee Stock Purchase Plan.
  • Financial planning and group legal services.
  • Voluntary benefits including auto, homeowner, and pet insurance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service