PDDN - San Jose, CA

posted 3 months ago

Full-time
San Jose, CA
Professional, Scientific, and Technical Services

About the position

We seek a highly skilled and dynamic Site Reliability Engineer Consultant. In this role, you will maintain and improve the reliability, performance, and availability of software systems. You will act as a bridge between traditional IT operations and software development, bringing a software engineering approach to system administration. Your responsibilities will include creating and supporting automation scripts for infrastructure deployments, validations, and monitoring to improve operational tasks. You will also be involved in scheduling monitoring scripts and utilizing various monitoring tools to ensure system health and performance. This position requires a strong background in IT infrastructure, particularly with Linux systems, and experience in programming and automation tools.

Responsibilities

  • Creating and supporting automation scripts (shell/ansible/python) for infrastructure deployments, validations, and monitoring to improve operational tasks
  • Scheduling monitoring scripts using cron and airflow
  • Monitoring using tools including Dynatrace, Apica, Grafana, etc.
  • Database handling
  • Building CICD pipelines
  • Incident handling and problem management

Requirements

  • Experience in Ansible/Python
  • Monitoring Tools: Dynatrace/Apica/Grafana
  • 14 plus years of IT Infrastructure experience
  • Extensive experience working with Linux flavors like RHEL/CentOS OS, shells, filesystems, and utilities
  • Experience in programming languages like Python, Ansible
  • Knowledge of distributed computing and experience working with container orchestration frameworks including on-prem and Rancher Kubernetes, and good knowledge of Kubernetes objects
  • Experience working with Storage, ONTAP is preferable: volume, aggregates, backups, DR planning
  • Experience scheduling monitoring scripts using cron and airflow
  • Experience with monitoring tools including Dynatrace, Apica, Grafana, etc.
  • Database knowledge including SQL and NoSQL DBs
  • Experience building CICD pipelines (preferred)
  • Cloud platform knowledge (specifically AWS) is required
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service