Klaviyo - Boston, MA

posted 4 months ago

Full-time - Senior
Boston, MA
Publishing Industries

About the position

As a Lead Site Reliability Engineer joining the SRE team at Klaviyo, you will play a pivotal role in shaping the future of our infrastructure platform. This position involves engaging in strategic discussions to design and evolve our systems, implementing new patterns and standards, and expanding the SRE team. You will work closely with the entire engineering organization to ensure that our infrastructure is robust and scalable. The SRE team is responsible for building foundational backend services and creating tools and automation that enable product teams to release and scale their software reliably and predictably. SREs are collaborative team members who integrate with product teams to enhance the architecture and performance of software systems, while also training peers in critical areas such as debugging distributed systems and developing self-healing applications. In this role, you will provide technical leadership to influence technology choices and architectural decisions across multiple teams. You will design and implement the Embedded SRE program at Klaviyo, focusing on identifying and addressing the most significant infrastructure and reliability challenges faced by engineering teams. By embedding yourself within product engineering teams, you will observe their daily routines, pinpoint sources of stress, and work alongside engineers to implement effective solutions. Additionally, you will mentor engineers to cultivate new technical leaders within the company, fostering a culture of continuous improvement and excellence in engineering practices. The ideal candidate will have extensive experience in architecting and delivering complex systems, with a proven track record of building impactful products. You should be adept at navigating the trade-offs of technical design decisions and possess the ability to lead teams through challenges. A passion for agile development and a commitment to shipping code frequently, while collaborating with product management to enhance software quality, are essential attributes for success in this role.

Responsibilities

  • Provide technical leadership to drive technology choices and architectural decisions across multiple teams.
  • Design and build the Embedded SRE program at Klaviyo, developing processes to discover and solve engineering's biggest infrastructure and reliability pain points.
  • Embed on product engineering teams to observe daily routines, identify stress sources, and work with engineers to implement changes.
  • Mentor multiple engineers to develop new technical leadership within the company.
  • Collaborate with product teams to enhance the architecture and performance of software systems.

Requirements

  • 10+ years of experience in software engineering and architecture.
  • Proven experience in building products that matter and improving engineering practices.
  • Ability to build and scale complex distributed systems sustainably.
  • Experience in navigating technical design trade-offs and risk assessment.
  • Proficiency in agile development and shipping code frequently.

Nice-to-haves

  • Experience with Python, Django, and Celery.
  • Familiarity with MySQL, RabbitMQ, Redis, and Apache Pulsar.
  • Knowledge of Bash, Puppet, and Terraform.
  • Experience with Amazon Web Services (EC2, RDS, Aurora, etc.) and Kubernetes on EKS.

Benefits

  • Medical, dental, and vision coverage
  • Health savings accounts
  • Flexible spending accounts
  • 401(k) plan
  • Flexible paid time off
  • Company-paid holidays
  • Learning allowance
  • Access to professional coaching services
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service