Site Reliability Engineer

$104,000 - $114,400/Yr

Indotronix International Corporation - Phoenix, AZ

posted 17 days ago

Full-time - Mid Level
Phoenix, AZ
Professional, Scientific, and Technical Services

About the position

The Site Reliability Engineer (SRE) role at Indotronix focuses on applying software engineering principles to enhance the reliability of systems and operations. The position involves monitoring systems, automating tasks, and collaborating with various teams to improve performance and resolve incidents. This role is essential in bridging the gap between infrastructure and application teams, ensuring operational excellence and reliability in a fast-paced environment.

Responsibilities

  • Monitor systems and infrastructure to maintain operational and performance levels
  • Rotational on-call responsibilities
  • Collaborate with other SRC professionals/engineers during incidents and provide consultation/resolution
  • Anticipate potential problems and collaborate on solutions
  • Gather and analyze metrics from tools and logs for performance tuning and fault resolution
  • Create sustainable systems through automation and process enhancement
  • Build automation to manage SRC operations and minimize manual functions
  • Engage in post-incident reviews for improvements and cause determination
  • Act as the main point of contact for running incident management calls

Requirements

  • Bachelor's degree in Engineering, Computer Science, or related field (or equivalent experience)
  • 6 years' experience supporting a large enterprise center
  • Strong troubleshooting and problem-solving skills
  • Working knowledge of Linux and Windows server administration
  • Experience with VCE/UCP and VMware versions 6 and above
  • Knowledge of CIFS/NFS, DPA reporting, and Avamar administration
  • Familiarity with middleware technologies like WebSphere, Apache, and IIS
  • Understanding of networking protocols and OSI Model
  • Proficiency in ITSM processes and operations analytics methodologies
  • ITIL fundamentals knowledge

Nice-to-haves

  • Network+ Certification
  • Experience with ServiceNow, TrueSight, Jira, and Confluence
  • Adaptability to prioritize critical incidents in a high-volume environment
  • Strong communication and interpersonal skills
  • Self-motivated with the ability to work independently or in a team

Benefits

  • Competitive salary
  • Contract to hire opportunity
  • Training provided for 2-3 weeks
  • Flexible working hours
  • Inclusive workplace culture
  • Commitment to pay equity
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service