Cribl - Denver, CO
posted 5 months ago
Cribl is on a mission to unlock the value of all observability data, and we are looking for a Staff Site Reliability Engineer (SRE) to join our team. As a remote-first company, we empower our employees to do their best work from anywhere. In this role, you will be part of a collaborative and motivated team that is passionate about putting customers first and delivering high-quality software. You will engage with various teams to improve service delivery and reliability across the entire lifecycle of our products. Your contributions will help shape the future of our technology, allowing customers to have full control over their observability data. As a Staff SRE, you will be involved in all aspects of our systems, from conception and design to development and production. You will measure and monitor production systems, focusing on availability, latency, and overall system health. Your role will also involve seeking out the causes of errors and instability in our cloud services and driving teams towards operational excellence. You will work closely with product and platform teams to lobby for changes that enhance reliability, resilience, and observability. Additionally, you will help identify and reduce toil through creative innovation and automation, and you will have on-call responsibilities as part of your role. This position is ideal for someone who has extensive experience in enterprise-scale continuous delivery environments and a strong background in DevOps or SRE practices. If you are passionate about reliability and have strong opinions on how to improve systems, we want to hear from you!