This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Canonical Group - Salt Lake City, UT

posted about 2 months ago

Full-time - Mid Level
Remote - Salt Lake City, UT
Professional, Scientific, and Technical Services

About the position

The Site Reliability / Gitops Engineer role at Canonical offers an opportunity for a hands-on technologist passionate about Linux and open source products. This position focuses on driving operations automation within Canonical's IT production services, which support over 60 million Ubuntu users. The engineer will leverage Infrastructure as Code (IaC) practices, collaborate with development teams, and enhance Canonical's products through critical feedback and contributions. The role emphasizes automation, resilience, and scalability in cloud and container environments, while also providing mentorship and support within a global team of Site Reliability Engineers.

Responsibilities

  • Apply experience of IaC to develop infrastructure as code practice within IS by increasing automation and improving IaC processes.
  • Automate software operations for re-usability and consistency across private and public clouds, considering complexities of distributed systems.
  • Develop new features and improve the resilience and scalability of Canonical's cloud and container portfolio.
  • Maintain operational responsibility for all of Canonical's core services, networks, and infrastructure.
  • Develop skills in troubleshooting, capacity planning, and performance investigation; set up and maintain observability tools such as Prometheus, Grafana, and Elasticsearch.
  • Collaborate with development teams to design service architecture, documentation, playbooks, policies, and operational procedures.
  • Provide assistance and work with globally distributed engineering, operations, and support peers.
  • Focus on larger projects and automation of manual tasks during uninterrupted development time.
  • Share experience, know-how, and best practices with team members in design sessions, mentorship, and collaborative work.
  • Carry final responsibility for time-critical escalations.

Requirements

  • Deep experience in defining operations in code using version control, peer review, and CI/CD for application and infrastructure changes.
  • Strong modern engineering background including peer-review, unit testing, SCM, CI/CD, and Agile methodologies.
  • Python software development experience with large projects.
  • Practical knowledge of Linux networking, routing, and firewalls.
  • Affinity with various forms of Linux storage, from Ceph to databases.
  • Hands-on experience administering enterprise Linux servers.
  • Extensive knowledge of cloud computing concepts and technologies.
  • Bachelor's degree or greater in computer science or related engineering field.
  • Clear and effective communication skills in English across various mediums.
  • Motivated to troubleshoot from kernel to web and willing to seek help when necessary.
  • Flexibility and quick learning ability in fast-changing environments.
  • Ability to work within distributed teams and a passion for open-source, especially Ubuntu or Debian.

Benefits

  • Fully remote working environment
  • Personal learning and development budget of 2,000 USD per annum
  • Annual compensation review
  • Recognition rewards
  • Annual holiday leave
  • Parental Leave
  • Employee Assistance Programme
  • Opportunity to travel to new locations to meet colleagues at 'sprints'
  • Priority Pass for travel and travel upgrades for long haul company events
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service