DevOps Engineer

$195,000 - $195,000/Yr

Unclassified - New York, NY

posted 4 months ago

Full-time
Remote - New York, NY

About the position

The position involves designing, implementing, and optimizing DevOps processes within the company. The successful candidate will be responsible for creating and maintaining automated deployment and monitoring processes that enhance the velocity of the engineering team. This includes designing automated fault detection infrastructure and systems that operate continuously, with minimal downtime measured in minutes over the course of a year. The role also requires automating operational tasks while proactively identifying and addressing potential risks. In addition, the candidate will develop statistical and machine learning models aimed at fraud prevention and other relevant use cases. They will utilize data and models to support the development of risk mitigation strategies and interventions, ensuring that user experience is preserved and improved. The role includes responsibilities such as Terraforming SardineAI's entire infrastructure, migrating existing infrastructure to Kubernetes, and implementing unique automation tools like Datadog as code. The position also focuses on improving the monitoring and resilience of the infrastructure, as well as enhancing its scalability, reliability, and performance. The candidate will be expected to write high-quality code in various programming languages, including Python, Ruby, Scala, and Go, and create reusable simplified code for engineers to build dashboards for their teams. Furthermore, building CI/CD pipelines, security controls, monitoring capabilities, and ensuring that these pipelines are well-structured with future-proofing solutions are key responsibilities. The role also involves improving telemetry tooling, specifically tracing support for the architecture.

Responsibilities

  • Design, implement and optimize DevOps processes within the company.
  • Design, build and maintain automated deployment and monitoring processes to increase the velocity of the engineering team.
  • Design automated fault detection infrastructure and systems that run in 24x7 mode with yearly downtime measured in minutes.
  • Automate operational tasks and proactively identify and address risks.
  • Develop statistical and machine learning models for fraud prevention and other use cases.
  • Use data and models to support the development of risk mitigation strategies and interventions while preserving and improving the user experience.
  • Terraform SardineAI's entire infrastructure.
  • Migrate existing infrastructure to Kubernetes.
  • Implement unique automation tools like Datadog as a code.
  • Improve the monitoring and resilience of the infrastructure.
  • Enhance the scalability, reliability, and performance of the infrastructure.
  • Write high quality code in programming languages (e.g. Python, Ruby, Scala, Go) and create reusable simplified code for engineers to use to build up dashboards for their team.
  • Build CI/CD pipelines, security controls, monitoring capabilities, build configuration, integration processes and pipeline, ensuring CI/CD pipelines are built well with future proofing solutions.
  • Improve telemetry tooling i.e. Tracing support for our architecture.

Requirements

  • Bachelor's degree in Computer Science, Computer Engineering or closely related field, plus one (1) year of experience in developing in Python and Terraform.
  • Experience using Google Cloud Platform and Sentry.
  • Experience utilizing Kubernetes and Helm for deployments.
  • Experience using Linux and Bash.
  • Programming experience in GoLang.
  • Experience using Git and Docker.
  • Experience using PostgreSQL.
  • Knowledge of CI/CD pipeline structure and workflows.
  • Experience with container orchestration and cloud deployments.
  • Understanding of data structures and algorithms.
  • Experience creating automated workflows.
  • Experience integrating APIs.
  • Knowledge of dashboard architecture.
  • In lieu of a Bachelor's degree, three (3) years of relevant experience may be accepted.

Benefits

  • Telecommuting allowed, can be performed 100% remotely from anywhere within the U.S.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service