The Judge Group - Phoenix, AZ

posted 7 days ago

Full-time - Entry Level
Phoenix, AZ
Administrative and Support Services

About the position

The Site Reliability Engineer (SRE) role is focused on maintaining operational and performance levels of systems and infrastructure. This position involves monitoring, troubleshooting, and collaborating with other professionals to ensure system reliability and efficiency. The SRE will engage in automation, process enhancement, and post-incident reviews to improve system performance and prevent future issues.

Responsibilities

  • Monitor systems and infrastructure to maintain operational and performance levels
  • Rotational on-call responsibilities
  • Work closely with other SRC professionals/engineers when issues arise, collaborate on troubleshooting, and provide consultation/resolution with events/incidents
  • Anticipate potential problems before they become impacting and collaborate to determine solutions
  • Gather and analyze metrics from tools and system/application logs to assist in performance tuning, fault finding, and resolution
  • Create sustainable systems and services through automation, processes enhancement, tools, and noise reduction
  • Build automation to manage the SRC operations and eliminate/minimize manual functions and toil
  • Collaborate with Application/Infrastructure support engineers and operations teams
  • Engage in post-incident reviews for improvements and determining the cause to prevent recurrence

Requirements

  • Bachelor's degree in Engineering, Computer Science, or related field required (or equivalent experience)
  • 2 years experience supporting a large enterprise center
  • Possess a breadth and depth of technical and management knowledge
  • Continuous improvement mindset, always looking for opportunities to streamline, routinize, or automate
  • Working knowledge across technology support areas including Server, Converged Solutions, Storage, Middleware, Mainframes, Networking, Workflow and Knowledge Management, and Collaboration Tools
  • Strong troubleshooting and problem-solving skills, with the ability to analyze and resolve complex technical issues
  • Familiarity with ITIL fundamentals and ITSM processes

Nice-to-haves

  • Network+ Certification
  • Experience with ServiceNow, TrueSight, Jira, and Confluence
  • Proficiency in operations analytics methodologies to drive performance improvement (e.g., Lean)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service