ServiceNow - San Diego, CA

posted about 2 months ago

Full-time - Mid Level
San Diego, CA
5,001-10,000 employees
Professional, Scientific, and Technical Services

About the position

The Senior Linux System Administrator will play a crucial role in the administration and operations of ServiceNow's global cloud infrastructure, specifically supporting US Federal customers. This position involves working closely with engineers and developers to ensure the availability and efficiency of the server infrastructure that runs the SaaS platform. The role includes responsibilities in configuration management, automation, troubleshooting, and preparing new products for production readiness.

Responsibilities

  • Contribute to Configuration Management and Infrastructure as Code for ServiceNow's global private cloud.
  • Develop tools in Python, bash, and JavaScript to replace manual work and improve customer maintenance experience.
  • Drive enhancements and bugfixes for large scale automation projects such as patching, provisioning, and kickstart domains.
  • Design and implement procedures for maintenance where automation cannot be applied; drive resolution of root causes with internal team members.
  • Prepare new ServiceNow products and services for production readiness with design review, feedback to engineering teams, training, and testing.
  • Use broad knowledge and experience of systems administration and networking principles to proactively prevent and address incidents while constantly improving documentation.
  • Participate in escalations and Root Cause Analysis of issues in both US Federal and global Commercial infrastructures.
  • Troubleshoot database backup and restore failures as well as perform database migrations.
  • Support operation of a wide variety of infrastructure services including Machine Learning and Prediction, Cloudera Big Data clusters, Kafka and RabbitMQ messaging, database encryption, E-Mail infrastructure at scale, DNS, Puppet, Elasticsearch, F5 BigIP, and more.

Requirements

  • Expert-level skills and background in systems administration and engineering.
  • Strong Linux expertise, specifically with RedHat and/or CentOS.
  • 4+ years of experience with Linux.
  • Experience with performance and availability monitoring, analysis, and configuration management platforms (e.g. Nagios/Icinga, Cacti, Ansible, Puppet, cfengine, chef, Splunk, Logstash).
  • Working level knowledge of one: Perl, Python, JavaScript.
  • Familiarity with MySQL, Oracle, MariaDB, or similar technologies; proficiency preferred.
  • Expert-level skills and experience with service troubleshooting in a production environment covering web front-end, Systems, Databases and Networks.
  • Familiarity with Networking Technologies such as routing, switching and load balancing; F5 and NGINX experience is ideal.
  • Understanding of ITIL v3 framework and how it applies to incidents, problems and change.
  • Good communication skills and ability to work well in a collaborative team environment.

Nice-to-haves

  • Prior experience in Site Reliability Engineering/DevOps and managing large-scale server infrastructure at a cloud computing or MSP setting is highly desirable.

Benefits

  • Health plans including flexible spending accounts
  • 401(k) Plan with company match
  • Employee Stock Purchase Plan (ESPP)
  • Matching donations
  • Flexible time away plan
  • Family leave programs
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service