Oracle - Salt Lake City, UT

posted 3 months ago

Full-time - Mid Level
Salt Lake City, UT
Publishing Industries

About the position

We are looking for a Site Reliability Engineering (SRE) engineer with 10 years of industry experience to join our team. The SRE engineer will be responsible for ensuring the reliability and availability of our company's production systems. They will work closely with our development team to implement and maintain a high level of system hygiene, and will be responsible for identifying and addressing any potential issues that may impact the performance of our systems. The ideal candidate for this position will have extensive experience with Linux system administration, as well as experience with cloud-based technologies such as AWS or GCP. They will also have experience with software development tools and processes, and will be comfortable working with both developers and system administrators to ensure the reliability and performance of our production systems. We are looking for a candidate who is passionate about site reliability and who is willing to take ownership of the performance of our systems. The candidate should be comfortable working in a fast-paced environment and should be able to quickly identify and address issues. The responsibilities include designing, developing, and maintaining software applications and infrastructure for Oracle products. The SRE engineer will collaborate with software engineers to identify and resolve issues related to software applications. They will also develop and maintain automation tools for deployment, monitoring, and maintenance of software applications. Working with cross-functional teams to ensure the reliability, scalability, and performance of software applications is a key part of this role. A Post Graduate degree in Computer Science or a related field is required, along with 10+ years of experience in software engineering practices and IT operations tasks, including 5+ years in an SRE role and 5+ years working with SQL and Oracle databases for data exploration and analysis. Strong experience in Oracle Fusion Application Architecture and Operating Procedures is essential, as well as experience with data analysis tools like SQL and Python, and cloud infrastructure such as OCI, AWS, or Azure. Strong problem-solving skills and experience with log analysis tools like OCI Log Analytics and Splunk are also required. The candidate should have experience using tools to identify performance issues with applications deployed in WebLogic Servers and possess strong experience in exploratory data analysis (EDA). US Citizenship is required for this position.

Responsibilities

  • Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.
  • Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
  • Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance.
  • Authority for end-to-end performance and operability.
  • Partner with development teams in defining and implementing improvements in service architecture.
  • Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.
  • Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack.
  • Demonstrate clear understanding of automation and orchestration principles.
  • Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
  • Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.
  • Understand and explain the effect of product architecture decisions on distributed systems.
  • Exhibit professional curiosity and a desire to develop a deep understanding of services and technologies.

Requirements

  • Post Graduate degree in Computer Science or related field.
  • 10+ years of experience in software engineering practices and IT operations tasks.
  • 5+ years in an SRE role.
  • 5+ years working with SQL and Oracle database for data exploration and analysis.
  • Strong experience in Oracle Fusion Application Architecture and Operating Procedures.
  • Experience with data analysis tools like SQL and Python.
  • Experience with cloud infrastructure such as OCI, AWS, or Azure.
  • Strong problem-solving skills.
  • Strong experience with log analysis tools like OCI Log Analytics and Splunk.
  • Experience using tools to identify performance issues with applications deployed in WebLogic Servers.
  • Strong experience in exploratory data analysis (EDA).
  • US Citizenship required.

Benefits

  • Medical, dental, and vision insurance, including expert medical opinion
  • Short term disability and long term disability
  • Life insurance and AD&D
  • Supplemental life insurance (Employee/Spouse/Child)
  • Health care and dependent care Flexible Spending Accounts
  • Pre-tax commuter and parking benefits
  • 401(k) Savings and Investment Plan with company match
  • Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position.
  • Accrued Vacation is provided to all other employees eligible for vacation benefits.
  • 11 paid holidays
  • Paid sick leave: 72 hours of paid sick leave upon date of hire.
  • Paid parental leave
  • Adoption assistance
  • Employee Stock Purchase Plan
  • Financial planning and group legal
  • Voluntary benefits including auto, homeowner and pet insurance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service