This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Canonical Group - Oklahoma City, OK

posted about 2 months ago

Full-time - Mid Level
Remote - Oklahoma City, OK
Professional, Scientific, and Technical Services

About the position

The Site Reliability / Gitops Engineer role at Canonical is a hands-on position focused on driving operations automation and improving infrastructure as code practices. The engineer will work within the IS team to support and maintain Canonical's IT production services, which serve over 60 million Ubuntu users. This position involves collaborating with development teams, automating software operations, and enhancing the resilience and scalability of cloud and container services.

Responsibilities

  • Apply experience of Infrastructure as Code (IaC) to develop practices within IS by increasing automation and improving IaC processes.
  • Automate software operations for re-usability and consistency across private and public clouds, considering complexities of distributed systems.
  • Develop new features and improve the resilience and scalability of Canonical's cloud and container portfolio.
  • Maintain operational responsibility for all of Canonical's core services, networks, and infrastructure.
  • Develop skills in troubleshooting, capacity planning, and performance investigation; set up and maintain observability tools such as Prometheus, Grafana, and Elasticsearch.
  • Collaborate with development teams to design service architecture, documentation, playbooks, policies, and operational procedures.
  • Provide assistance and work with globally distributed engineering, operations, and support peers.
  • Focus on larger projects and automation of manual tasks during uninterrupted development time.
  • Share experience, know-how, and best practices with team members in design sessions and mentorship.
  • Carry final responsibility for time-critical escalations.

Requirements

  • Deep experience in defining operations in code using version control, peer review, and CI/CD for rolling out changes to applications and infrastructure.
  • Strong modern engineering background including peer-review, unit testing, SCM, CI/CD, and Agile methodologies.
  • Python software development experience with large projects.
  • Practical knowledge of Linux networking, routing, and firewalls.
  • Affinity with various forms of Linux storage, from Ceph to databases.
  • Hands-on experience administering enterprise Linux servers.
  • Extensive knowledge of cloud computing concepts and technologies.
  • Bachelor's degree or greater, preferably in computer science or a related engineering field.
  • Ability to communicate clearly and effectively in English across various mediums.
  • Motivated to troubleshoot from kernel to web and willing to seek help when necessary.
  • Willingness to be flexible and learn new things quickly.
  • Ability to adapt to fast-changing environments and work within distributed teams.
  • Passion for and familiarity with open-source technologies, especially Ubuntu or Debian.

Nice-to-haves

  • Experience with CI/CD pipelines and DevOps practices.
  • Familiarity with container orchestration tools like Kubernetes.
  • Knowledge of security best practices in cloud environments.

Benefits

  • Fully remote working environment.
  • Personal learning and development budget of 2,000 USD per annum.
  • Annual compensation review.
  • Recognition rewards.
  • Annual holiday leave.
  • Parental Leave.
  • Employee Assistance Programme.
  • Opportunity to travel to new locations to meet colleagues at 'sprints'.
  • Priority Pass for travel and travel upgrades for long haul company events.
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service