Pyramid Consulting - Phoenix, AZ

posted 7 days ago

Full-time - Mid Level
Phoenix, AZ
Professional, Scientific, and Technical Services

About the position

The Lead Site Reliability Engineer will be responsible for monitoring systems and infrastructure to maintain operational and performance levels. This role requires a strong technical background and management knowledge, with a focus on continuous improvement and automation within a large enterprise data center environment.

Responsibilities

  • Rotational on-call responsibilities
  • Work closely with other SRC professionals/engineers when issues arise, collaborate on troubleshooting, and provide consultation/resolution with events/incidents
  • Anticipate potential problems before they become impacting and collaborate to determine solutions
  • Gather and analyze metrics from tools and system/application logs to assist in performance tuning, fault finding, and resolution
  • Create sustainable systems and services through automation, processes enhancement, tools, and noise reduction
  • Build automation to manage the SRC operations and eliminate/minimize manual functions and toil
  • Collaborate with Application/Infrastructure support engineers and operations teams
  • Engage in post-incident reviews for improvements and determining the cause to prevent recurrence

Requirements

  • 8+ years of experience in Site Reliability Engineering
  • 2 years experience supporting a large enterprise data center
  • Strong troubleshooting and problem-solving skills
  • Bachelor's degree in Engineering, Computer Science, or related field required (or equivalent experience)
  • Working knowledge in server administration and troubleshooting in Linux and Windows
  • Experience with converged solutions, specifically VCE/UCP and VMWare versions 6 and above
  • Knowledge of storage solutions including CIFS/NFS, Avamar, and Data Domain administration
  • Familiarity with middleware technologies such as WebSphere, Apache, IIS, WebLogic, and Tomcat
  • Understanding of mainframe technologies including JCL and CICS SYSPLEX
  • Strong understanding of network protocols and OSI Model, Network+ Certification preferred
  • Proficiency in ITSM processes and operations analytics methodologies (e.g., Lean)
  • Familiarity with ServiceNow, TrueSight, Jira, and Confluence

Benefits

  • Contract to hire opportunity
  • Hybrid work environment in Phoenix
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service