This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Kognitos - San Jose, CA

posted 4 days ago

Hybrid - San Jose, CA
Publishing Industries

About the position

Kognitos is at the forefront of revolutionizing the trillion-dollar hyper-automation market. Our mission is to redefine how software is built and maintained by leveraging cutting-edge multi-agent automation platforms. We are pioneering advancements in agentic workflows, enabling machines to reason, plan, and execute tasks in a deterministic fashion. We're looking for a Developer Productivity Engineer with an SRE background to help us streamline and optimize our development processes, making a significant impact on both developer efficiency and overall system reliability. In this hybrid role, you will work at the intersection of Developer Productivity and Site Reliability Engineering (SRE). You'll be responsible for improving developer workflows, building tools to automate repetitive tasks, and ensuring our systems are robust, reliable, and performant. Your role is crucial in helping developers at Kognitos ship features faster while maintaining high system availability.

Responsibilities

  • Design, build, and maintain internal tools and automation frameworks that enhance developer efficiency and reduce toil.
  • Identify bottlenecks in the development workflow and implement solutions to streamline processes.
  • Work closely with engineering teams to understand their needs and provide effective tooling, documentation, and process improvements.
  • Implement SRE best practices, including monitoring, alerting, and capacity planning, to ensure a stable, high-performance infrastructure.
  • Automate deployment pipelines, conduct system health checks, and reduce manual intervention for incident management.
  • Collaborate with cross-functional teams to develop and enforce SLAs, SLOs, and SLIs, ensuring a resilient infrastructure.
  • Advocate for testing practices that improve code quality and maintainability, including CI/CD pipelines, code reviews, and static code analysis.
  • Optimize resource utilization and enhance scalability through proactive performance tuning.
  • Serve as a bridge between development and operations teams, promoting a DevOps culture.
  • Provide mentorship on development best practices and DevOps tools to support our engineering culture of continuous improvement.

Requirements

  • Proven experience in Developer Productivity Engineering or SRE roles.
  • Proficiency with cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker).
  • Familiarity with Infrastructure as Code (IaC) tools like Terraform, Ansible, or CloudFormation.
  • Strong scripting and automation skills (Python, Bash, or similar).
  • Strong problem-solving abilities and attention to detail.
  • Ability to work collaboratively in cross-functional teams, communicating effectively with technical and non-technical stakeholders.
  • Adaptability and willingness to learn new tools and technologies.

Nice-to-haves

  • Experience with observability tools (Prometheus, Grafana, Datadog) for monitoring and alerting.
  • Familiarity with CI/CD tools and best practices, especially GitHub Actions, Jenkins, or equivalent.
  • Previous work in a startup environment or on rapid-growth engineering teams is a plus.
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service