Burgeon IT Services - Sunnyvale, CA

posted 5 days ago

Full-time - Senior
Sunnyvale, CA

About the position

The Senior Site Reliability Engineer (SRE) & DevOps Automation role focuses on ensuring the reliability and performance of systems while automating processes to enhance efficiency. This position requires a strong background in Site Reliability Engineering and DevOps practices, with a particular emphasis on automation tools and cloud platforms. The role is based onsite in Sunnyvale, CA, and is offered as a contract position, making it ideal for professionals with extensive experience in product-based companies.

Responsibilities

  • Ensure the reliability and performance of systems as a Senior Site Reliability Engineer.
  • Automate processes using various tools to enhance operational efficiency.
  • Manage and maintain Linux/Unix systems and networking concepts.
  • Implement and manage CI/CD pipelines for continuous integration and delivery.
  • Utilize cloud platforms such as AWS, Azure, or Google Cloud Platform for deployment and management.
  • Work with containerization technologies like Docker and Kubernetes.
  • Monitor system performance using tools like Prometheus, Grafana, and Datadog.
  • Conduct incident management and perform root cause analysis for system issues.

Requirements

  • 8+ years of experience in Site Reliability Engineering (SRE) or DevOps Automation.
  • Strong experience with Linux/Unix systems and networking concepts.
  • Expertise in automation tools such as Terraform, Ansible, Chef, or Puppet.
  • Proficient in cloud platforms like AWS, Azure, or Google Cloud Platform.
  • Experience with containerization technologies (e.g., Docker, Kubernetes, Helm).
  • Deep knowledge of CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI, CircleCI).
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog, ELK stack).
  • Strong programming skills in Python, Go, or Shell scripting.
  • Excellent understanding of system design, microservices architecture, and high-availability systems.
  • Experience in incident management and ability to perform root cause analysis.
  • Familiarity with agile methodologies and fast-paced development environments.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service