Confluent - Providence, RI

posted 2 months ago

Full-time - Mid Level
Remote - Providence, RI
Publishing Industries

About the position

At Confluent, we are on a mission to empower organizations to harness the full potential of continuously flowing data, enabling them to innovate and thrive in the modern digital landscape. As a Federal Site Reliability Engineer, you will play a pivotal role in this mission by working closely with key public sector agencies. Your passion for data will help transform events into actionable outcomes, facilitating the development of intelligent, real-time applications that allow teams and systems to respond to data instantaneously. In this role, you will be at the forefront of delivering highly performant and reliable systems through Confluent Cloud, a comprehensive end-to-end streaming experience offered as a Software as a Service (SaaS) model. You will partner with our Cloud Architecture and Engineering teams to enhance the operational resiliency of Confluent Cloud systems utilized by federal agencies. Your collaboration will extend across various teams to verify and deploy production changes, ensuring that our systems meet the stringent compliance requirements of FedRAMP data handling. You will also maintain critical monitoring systems for triage and escalations in the federal space, continuously improving automated recovery processes. By adhering to established incident and change management processes, you will help drive ongoing improvements that enhance our service delivery and operational excellence.

Responsibilities

  • Partner with Cloud Architecture and Engineering teams to enhance operational resiliency of Confluent Cloud systems for federal agencies.
  • Collaborate across teams to verify and deploy production changes to Confluent Cloud systems and infrastructure.
  • Engage with peer engineering teams during incidents using an 'escort model' to ensure compliance with FedRAMP data handling requirements.
  • Maintain critical monitoring systems for triage and escalations in the federal space and improve automated recovery processes.
  • Adhere to established incident and change management processes and drive continuous improvements.

Requirements

  • U.S. Citizenship is required to comply with U.S. federal government regulations.
  • 6+ years of relevant experience in a related field.
  • Expertise in Cloud Native technologies with experience operating production services in the cloud.
  • Strong fundamentals of Distributed Systems and their design.
  • Deep knowledge of Kubernetes and containerization.
  • Experience with telemetry tooling to monitor production systems.
  • Confidence in problem-solving and troubleshooting critical services.
  • Proficiency with scripting and automation (e.g., Go, Java, Python, Bash).
  • Working knowledge of infrastructure as code (e.g., Terraform, CloudFormation, AWS CDK, Pulumi).
  • Exceptional teamwork and collaboration skills, with the ability to act critically with minimal supervision in a remote-first environment.
  • Experience with a rotating on-call schedule to provide 24/7 support.
  • BS Degree in Computer Science, Engineering, or equivalent experience.

Benefits

  • Competitive pay and benefits in line with industry standards.
  • Annual estimated salary of $145,920 - $171,440 USD.
  • Annual bonus and competitive equity package.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service