Full Steam - Boulder, CO

posted 3 months ago

Full-time - Senior
Boulder, CO
1,001-5,000 employees
Professional, Scientific, and Technical Services

About the position

At Fullsteam, we are dedicated to providing innovative software and payment solutions that empower small and medium-sized businesses, particularly in the craft beverage industry. As a Senior Site Reliability Engineer (SRE), you will play a crucial role in ensuring the reliability and performance of our platform. You will collaborate closely with development teams to design, build, and maintain the infrastructure that supports our systems, which are integral to the operations of thousands of craft breweries across the United States. Your expertise will help us enhance the operational capabilities of our software, ensuring that it meets the high standards our customers expect. In this role, you will be responsible for holistically monitoring system health and developing software solutions that improve operational efficiency. You will partner with developer teams to enhance the reliability, quality, and time to market of our point of sale and enterprise resource planning software. Additionally, you will work alongside quality assurance teams to implement rigorous testing and release procedures, ensuring that our software is well-tested before it reaches our customers. The ideal candidate for this position will have extensive experience with cloud infrastructure, distributed systems, and monitoring tools, including Google Cloud Platform (GCP), Kubernetes (K8s), Terraform, and Ansible. You will champion the reliability and scalability of our systems as we introduce new features, ensuring seamless integration and robust performance. Join us at Arryved, part of the Fullsteam organization, and make a direct impact on the craft brewing industry by leveraging your expertise to build and maintain a resilient, high-performance platform.

Responsibilities

  • Enhance the operational capabilities of software powering craft breweries by monitoring system health and developing software solutions.
  • Collaborate with developer teams to improve the reliability, quality, and time to market of point of sale and enterprise resource planning software.
  • Work with quality assurance teams to implement rigorous testing and release procedures for well-tested software.
  • Design, build, and maintain the infrastructure that supports the platform's reliability and performance.
  • Champion the reliability and scalability of systems as new features are introduced.

Requirements

  • Strong proficiency with Linux system administration, including scripting, networking, and automation.
  • Strong proficiency with Terraform, Kubernetes, and Ansible.
  • 3+ years of experience with Google Cloud Platform or AWS.
  • Excellent communication skills, particularly in writing, to facilitate clear communication across a distributed team.
  • Outstanding debugging skills and a desire to understand system operations at every level.
  • A mindset that prioritizes MVP (Minimum Viable Product) principles, emphasizing operational simplicity and efficiency.
  • Excellent stakeholder management skills for effective collaboration across teams.

Benefits

  • Inclusive workplace that values diversity of thought, experience, and background.
  • Equal Opportunity/Affirmative Action employer, ensuring consideration for all qualified applicants without discrimination.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service