Qumulo Careersposted 26 days ago
$140,000 - $190,000/Yr
Full-time • Mid Level

About the position

As an SRE at Qumulo, you will help to develop solutions that help to manage and monitor applications we use internally and to support our customers. We manage our internal build and test infrastructure which includes running multiple builds and hundreds of thousands of tests continuously in both on-prem environments and on the cloud (such as AWS and Azure Native Qumulo Scalable File Service [ANQ]). This build and test environment is a core part of our engineering processes, providing continuous feedback to our engineering teams and allowing us to deliver new product releases regularly throughout each year. We also build and operate managed components of ANQ, delivering a highly available service to customers and keeping the service up to date with our latest features. We work across engineering, product and customer success teams to identify opportunities to improve our processes and ensure that our existing systems are available and working as expected. We implement solutions that reduce work through automation, providing scalable solutions that span our on-prem and cloud environments. We help manage the operating expense of running systems across multiple clouds. We help drive down failures by providing frequent feedback to engineers on their changes with high quality test analytics.

Responsibilities

  • Collaborate with a team that identifies opportunities, plans new features, and implements solutions.
  • Work with team members to build a backlog and deliver solutions iteratively.
  • Troubleshoot build and test failures, diagnosing problems that vary from build time compilation failures to integration test failures involving both virtual machine instances and Qumulo qualified hardware.
  • Implement monitoring to ensure that systems are working as expected and can raise alerts when problems are detected.
  • Participate in an on-call rotation to respond to critical incidents impairing owned applications.

Requirements

  • Experience working in Linux (we use Ubuntu).
  • Experience with Python or similar programming languages.
  • Experience with system orchestration tools (such as Ansible, Terraform, and cloud specific implementations like AWS CloudFormation) is preferred.
  • Experience with one or more of the major cloud providers (AWS, GCP, Azure).
  • Functional working understanding of Kubernetes and working with containers to manage applications.
  • Experience with monitoring tools and technologies (we use a combination of home grown solutions that utilize OpenMetrics as well as tools like Grafana, InfluxDB, and Prometheus).
  • Experience troubleshooting systems issues.
  • Knowledge of build automation and test frameworks.

Benefits

  • Annual pay range of USD $140,000.00 - $190,000.00.
  • Excellent healthcare coverage.
  • Parental leave.
  • 401K investment plan.
  • Unlimited paid time off, strongly encouraged to take at least 3 weeks per year.

Job Keywords

Hard Skills
  • Ansible
  • AWS CloudFormation
  • InfluxDB
  • Kubernetes
  • Linux
  • 2YcTCh6Gzei
  • 3MRVGKAfJo
  • 43dXVtY
  • 6nebirsKjuTz sEdVmqeA6T5
  • BEpn70NxW mJsGQLFD
  • dOIsV1Go jaiFKHtYeQlh
  • dRKr2wLsaO8 y9W0
  • epo4XaO
  • gU8zG67Zq1b tkeora
  • iHK4LymrNB79 ETSMCmWn
  • ioPJ7zfR9 v5adwtxC
  • jOGEb8 uXSvTr4AbgB
  • jRbuxm1ErM QuaXKyWI
  • KzF0SRQudvBN 8JhiOloRBk
  • LP6DdJ EkNUe
  • Rp1OFElzTcP HVtBzTW5
  • UKBiADr0x2s fm9FQPztLIW
  • wgJELTtB xUN89f14CO6z
  • x4vnu7hA
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service