Cribl - Springfield, IL

posted 2 months ago

Full-time - Senior
Remote - Springfield, IL

About the position

Cribl is seeking a Staff Site Reliability Engineer (SRE) to enhance the reliability and performance of its observability data solutions. This remote role involves collaborating with various teams to improve service delivery, monitor production systems, and drive operational excellence. The ideal candidate will have a strong background in DevOps or SRE practices, with a focus on continuous delivery and cloud technologies.

Responsibilities

  • Engage with teams to improve service delivery and reliability across their entire lifecycle.
  • Measure and monitor all production systems focusing on availability, latency, and overall system health.
  • Identify the causes of errors and instability in production cloud services and drive teams towards operational excellence.
  • Collaborate with product and platform teams to enhance systems by advocating for changes that improve reliability, resilience, and observability.
  • Identify and reduce toil through creative innovation and automation.
  • Participate in on-call responsibilities.

Requirements

  • Extensive experience with enterprise scale continuous delivery environments.
  • 8+ years of experience in a DevOps or SRE role.
  • Development experience with JavaScript/Node.js/TypeScript in a Linux/Mac environment.
  • Experience with Configuration Management Tools like Terraform, Puppet, Chef, or Ansible.
  • Experience with sustainable incident response in a blameless environment.
  • Knowledge of cloud platforms, preferably AWS, and container orchestration technologies.
  • Experience with APM and Observability tools such as New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, Sentry, etc.
  • Background in Linux Systems Engineering.
  • Experience with incident response tools like PagerDuty, FireHydrant, or Blameless.
  • Ability to work autonomously in a distributed team.

Nice-to-haves

  • Knowledge of Cloud and application security.
  • Strong knowledge of cloud design patterns for scale, data management, and resiliency.
  • A passion for high quality and testing.
  • Opinions about dashboards, metrics, and SLOs.

Benefits

  • Health insurance
  • Dental insurance
  • Vision insurance
  • Short-term disability insurance
  • Life insurance
  • Paid holidays
  • Paid time off
  • Fertility treatment benefit
  • 401(k) plan
  • Equity options
  • Eligibility for a discretionary company-wide bonus
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service