Dendi - Cambridge, MA

posted 10 days ago

Full-time - Senior
Cambridge, MA
Professional, Scientific, and Technical Services

About the position

The Site Reliability Engineer (SRE) role at DENDI focuses on ensuring the reliability, availability, and performance of applications and systems. This position involves designing, building, and maintaining scalable infrastructure primarily using AWS services, while also fostering a collaborative DevOps culture within the team. The SRE will be responsible for deploying Docker containers, utilizing Infrastructure as Code (IaC) tools, and managing CI/CD pipelines, all while ensuring security best practices are followed.

Responsibilities

  • Ensure the reliability, availability, and performance of applications and systems.
  • Design, build, and maintain scalable and efficient infrastructure using AWS services.
  • Deploy Docker containers in production environments.
  • Utilize Infrastructure as Code (IaC) tools such as Terraform, Ansible, and Packer for automation.
  • Develop and manage complex CI/CD pipelines using tools like GitLab CI, GitHub Actions, etc.
  • Code in Python or Ruby and script in Shell (Bash) for automation and integration tasks.
  • Architect, support, and deploy large-scale systems from scratch.
  • Implement best practices for massive-scale data ingestion and messaging systems.
  • Work closely with developers and operations teams, fostering a true DevOps culture.
  • Document infrastructure and processes extensively, ensuring information is centralized in the company's knowledge base.
  • Communicate clearly in written form, using Slack, tickets, and documentation to share knowledge.
  • Stay updated on security best practices and ensure the infrastructure is secure.
  • Encourage collaboration and empower colleagues by sharing knowledge and offering feedback.

Requirements

  • Fully fluent English written and verbal communication.
  • 7 years of experience as an SRE, with diverse infrastructure experience.
  • In-depth knowledge of AWS services and 5+ years of hands-on experience deploying them in production environments.
  • Experience with Docker containers, Infrastructure as Code tools (Terraform, Ansible, Packer), and CI/CD pipelines (GitLab CI, GitHub Actions).
  • Expertise and deep knowledge of SRE to leverage while making executive decisions.
  • Strong coding skills in Python or Ruby and scripting in Shell (Bash).
  • Thorough understanding of the Software Development Lifecycle (SDLC).
  • Proven experience in architecting, supporting, and deploying large-scale systems from scratch.
  • Familiarity with massive-scale data ingestion and messaging systems.
  • Strong written communication skills and a commitment to documentation.
  • Curiosity and a proactive approach to problem-solving and learning new technologies.
  • Ability to work collaboratively in a remote-first environment.
  • Security-conscious mindset and up-to-date knowledge of security standards.

Benefits

  • Unlimited PTO.
  • Observance of local holidays (no work on most U.S. holidays).
  • A collaborative work environment where you can grow your skills and career.
  • Opportunities to work on diverse projects and technologies.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service