AWS DevOps Engineer

$104,000 - $124,800/Yr

Savvyan Technologies - Palo Alto, CA

posted 6 months ago

Full-time
Palo Alto, CA
Administrative and Support Services

About the position

As an AWS DevOps Engineer at Savvyan Technologies, you will play a crucial role in championing site reliability culture and practices within your team. Your primary responsibility will be to lead initiatives aimed at enhancing the reliability and stability of applications and platforms. This will involve utilizing data-driven analytics to improve service levels and collaborating with team members to identify comprehensive service level indicators. You will work closely with stakeholders to establish reasonable service level objectives and error budgets, ensuring that customer expectations are met effectively. In this role, you will demonstrate a high level of technical expertise in one or more domains, proactively identifying and resolving technology-related bottlenecks. You will act as the main point of contact during major incidents, showcasing your ability to quickly identify and solve issues to prevent financial losses. Additionally, you will be responsible for documenting knowledge within the organization through internal forums and communities of practice, contributing to a culture of continuous learning and improvement. Your experience and skills will be critical in implementing site reliability best practices, focusing on reliability, scalability, performance, security, and toil reduction. You will leverage your proficiency in programming languages and observability tools to enhance the overall performance of the applications and platforms you manage.

Responsibilities

  • Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team.
  • Leads initiatives to improve the reliability and stability of applications and platforms using data-driven analytics.
  • Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers.
  • Acts as the main point of contact during major incidents for applications, identifying and solving issues quickly to avoid financial losses.
  • Documents knowledge within the organization via internal forums and communities of practice.

Requirements

  • Formal training or certification on Software engineering concepts and 5+ years of applied experience.
  • Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices.
  • Fluency in at least one programming language such as Python, Java Spring Boot, or .Net.
  • Proficiency and experience in observability tools such as Grafana, Dynatrace, Prometheus, Datadog, and Splunk.
  • Proficiency in continuous integration and continuous delivery tools such as Jenkins, GitLab, and Terraform.
  • Experience with container and container orchestration tools such as ECS, Kubernetes, and Docker.

Nice-to-haves

  • Experience with infrastructure as code tools such as Terraform.
  • Experience managing/supporting Cloud-based applications, preferably AWS.
  • Excellent communication skills.
  • Background in Fin-tech may be helpful.
  • Troubleshooting common networking technologies and issues.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service