SRE Lead - W2 Contract

$113,693 - $149,760/Yr

Persistent Systems Ltd. - Phoenix, AZ

posted 4 days ago

Full-time - Mid Level
Hybrid - Phoenix, AZ
Professional, Scientific, and Technical Services

About the position

The Site Reliability Engineer (SRE) Lead position at Persistent Systems involves overseeing the reliability and performance of systems and infrastructure. This role emphasizes a software engineering approach to operations, focusing on automation, problem-solving, and collaboration between infrastructure and application teams. The SRE Lead will be responsible for monitoring systems, managing on-call duties, and driving continuous improvement in operational processes.

Responsibilities

  • Monitor systems and infrastructure to maintain operational and performance levels
  • Handle rotational on-call responsibilities
  • Collaborate with other SRE professionals/engineers during incidents and provide consultation for resolution
  • Anticipate potential problems and collaborate on solutions
  • Gather and analyze metrics from tools and logs for performance tuning and fault resolution
  • Create sustainable systems through automation and process enhancements
  • Build automation to manage operations and reduce manual tasks
  • Engage in post-incident reviews to improve processes and prevent recurrence

Requirements

  • Bachelor's degree in Engineering, Computer Science, or related field (or equivalent experience)
  • 8+ years of experience in an engineering role, with at least 2 years in a lead position
  • Strong knowledge of Linux and Windows server administration and troubleshooting
  • Experience with VCE/UCP, VMware, and network connectivity
  • Familiarity with CIFS/NFS, DPA reporting, and Data Domain administration
  • Proficient in middleware technologies such as WebSphere, Apache, IIS, WebLogic, and Tomcat
  • Understanding of networking protocols and OSI Model
  • Experience with ServiceNow, TrueSight, Jira, and Confluence
  • Knowledge of ITSM processes and operations analytics methodologies
  • Strong troubleshooting and problem-solving skills

Nice-to-haves

  • CompTIA Network+ certification
  • Experience with scripting in PowerShell and Bash
  • Familiarity with ITIL fundamentals
  • Experience in a high-volume incident management environment

Benefits

  • Flexible schedule
  • Health insurance
  • Life insurance
  • Paid time off
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service