Unclassified - New York, NY
posted 4 months ago
The position involves designing, implementing, and optimizing DevOps processes within the company. The successful candidate will be responsible for creating and maintaining automated deployment and monitoring processes that enhance the velocity of the engineering team. This includes designing automated fault detection infrastructure and systems that operate continuously, with minimal downtime measured in minutes over the course of a year. The role also requires automating operational tasks while proactively identifying and addressing potential risks. In addition, the candidate will develop statistical and machine learning models aimed at fraud prevention and other relevant use cases. They will utilize data and models to support the development of risk mitigation strategies and interventions, ensuring that user experience is preserved and improved. The role includes responsibilities such as Terraforming SardineAI's entire infrastructure, migrating existing infrastructure to Kubernetes, and implementing unique automation tools like Datadog as code. The position also focuses on improving the monitoring and resilience of the infrastructure, as well as enhancing its scalability, reliability, and performance. The candidate will be expected to write high-quality code in various programming languages, including Python, Ruby, Scala, and Go, and create reusable simplified code for engineers to build dashboards for their teams. Furthermore, building CI/CD pipelines, security controls, monitoring capabilities, and ensuring that these pipelines are well-structured with future-proofing solutions are key responsibilities. The role also involves improving telemetry tooling, specifically tracing support for the architecture.