Arista Networksposted 28 days ago
OR

About the position

Arista Networks is looking for Site Reliability Engineers to play an active role and have a high impact in the early rollout of both internal and customer-facing services making key architecture decisions, and designing and implementing best practices in advancing the Software Defined Networking revolution in the cloud. The Site Reliability Engineering (SRE) role combines software and systems engineering to build and run high performance, massively distributed, robust systems. The role is key in optimizing our system capacity and performance at all times. SRE roles at Arista are generally in one of two areas: Internal Tools: Designing and Operating our internal systems including CI/CD pipelines as well as source repos and other internal tools. External SaaS: An active role with a high impact on a cloud-based public SaaS across all Arista teams. Both roles have the freedom to push the envelope forward in terms of quality and availability while designing, choosing, and building their own best practices and tools to make that happen.

Responsibilities

  • Engage in and improve the whole lifecycle of services—from inception and design, deployment, operation, and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
  • Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.

Requirements

  • Bachelor's degree in Computer Science, a related technical field involving software/systems engineering, or equivalent practical experience.
  • Experience programming in the following languages: Go and Python.
  • Experience in operating a cloud-based SaaS.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Experience with Jenkins, Docker, K8s.
  • Ability to debug, optimize code, and automate routine tasks.
  • Understanding of Unix/Linux operating systems.

Job Keywords

Hard Skills
  • Docker
  • Go
  • Jenkins
  • Linux
  • Operating Systems
  • 0OJxuPQj ZOzlw71Sp
  • 2Rv3PAi zBAjMlhEW8ftJ
  • 7aeV2fX8S jxB4Iy39i
  • 7VrGkld0HUjX zsQc2a1bKnAX
  • 9KBfechtJGln 49HgpdInl6qa
  • 9ptNrEg VSiD9
  • BHJR8L1St pQHMeu3j9FfJ
  • DbJu5qrCN VhGlakyL
  • KSPRH
  • KuLRfAUEy BNYGkpaA 2aBgN6AGhm1
  • lIUL0E bRjmSfWHKkU
  • nBPwjLm
  • OeBfU3
  • OINtsPJ dHVqgT1J
  • SVWOdsuPthYb T5zyqrWS
  • u9MO0cBmHLyTs s6qVRNJKX3a
  • WLIzQahwG 0Q7bPXjYV
  • yX4A6uGC wZD71YPOR4gh
Build your resume with AI

A Smarter and Faster Way to Build Your Resume

Go to AI Resume Builder
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service