This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Canonical Group - Chicago, IL

posted about 2 months ago

Full-time - Mid Level
Remote - Chicago, IL
Professional, Scientific, and Technical Services

About the position

The Site Reliability / Gitops Engineer role at Canonical is a hands-on position focused on driving operations automation and enhancing the infrastructure as code (IaC) practices within the IS team. This team supports Canonical's IT production services, which are utilized by over 60 million Ubuntu users. The engineer will work with open-source technologies, CI/CD pipelines, and collaborate with development teams to improve Canonical products and services. The role emphasizes automation, scalability, and operational responsibility for core services, while also providing opportunities for personal development and collaboration within a global team.

Responsibilities

  • Apply experience of IaC to develop infrastructure as code practice within IS by increasing automation and improving IaC processes.
  • Automate software operations for re-usability and consistency across private and public clouds.
  • Develop new features and improve the resilience and scalability of the existing cloud and container portfolio at Canonical.
  • Maintain operational responsibility for all of Canonical's core services, networks, and infrastructure.
  • Develop skills in troubleshooting, capacity planning, and performance investigation, using observability tools such as Prometheus, Grafana, and Elasticsearch.
  • Collaborate with development teams to design service architecture, documentation, playbooks, policies, and operational procedures.
  • Provide assistance and work with globally distributed engineering, operations, and support peers.
  • Share experience, know-how, and best practices with team members in design sessions and mentorship.
  • Carry final responsibility for time-critical escalations.

Requirements

  • Deep experience of operations in code, using version control, peer review, and CI/CD to roll out changes to applications and infrastructure.
  • Strong modern engineering background including peer-review, unit testing, SCM, CI/CD, and Agile methodologies.
  • Python software development experience with large projects.
  • Practical knowledge of Linux networking, routing, and firewalls.
  • Affinity with various forms of Linux storage, from Ceph to Databases.
  • Hands-on experience administering enterprise Linux servers.
  • Extensive knowledge of cloud computing concepts and technologies.
  • Bachelor's degree or greater, preferably in computer science or related engineering field.
  • Ability to communicate clearly and effectively in English over various mediums.
  • Motivated and able to troubleshoot from kernel to web, willing to ask others when appropriate.
  • Willingness to be flexible and learn new things quickly.
  • Ability to work within distributed teams and be inspired by fast-changing environments.
  • Passion for open-source, especially Ubuntu or Debian.

Nice-to-haves

  • Experience with observability tools like Prometheus, Grafana, and Elasticsearch.
  • Familiarity with CI/CD tools and practices.
  • Experience in mentoring or leading teams.

Benefits

  • Fully remote working environment
  • Personal learning and development budget of 2,000 USD per annum
  • Annual compensation review
  • Recognition rewards
  • Annual holiday leave
  • Parental Leave
  • Employee Assistance Programme
  • Opportunity to travel to new locations to meet colleagues at 'sprints'
  • Priority Pass for travel and travel upgrades for long haul company events
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service