Confluent - Augusta, ME

posted 2 months ago

Full-time - Mid Level
Remote - Augusta, ME
Publishing Industries

About the position

At Confluent, we are on a mission to revolutionize the way organizations utilize data through our innovative data streaming technology. As a Federal Site Reliability Engineer, you will play a crucial role in enabling public sector agencies to leverage real-time data for impactful decision-making. This position offers the unique opportunity to work closely with key federal agencies, ensuring that their systems are not only highly performant but also reliable and compliant with stringent regulations. You will be at the forefront of delivering a complete end-to-end streaming experience via Confluent Cloud, a Software as a Service (SaaS) model that empowers agencies to solve real-time problems effectively. In this role, you will partner with our Cloud Architecture and Engineering teams to enhance the operational resiliency of Confluent Cloud systems. Your collaboration will extend across various teams to verify and deploy production changes, ensuring that our systems meet the high standards required by federal agencies. You will engage actively during incidents, utilizing an “escort model” to maintain compliance with FedRAMP data handling requirements. Your responsibilities will also include maintaining critical monitoring systems for triage and escalations, as well as driving continuous improvements in automated recovery processes. This position is ideal for someone who thrives in a dynamic environment and is passionate about using data to drive outcomes.

Responsibilities

  • Partner with Cloud Architecture and Engineering teams to enhance operational resiliency of Confluent Cloud systems.
  • Collaborate across teams to verify and deploy production changes to Confluent Cloud systems and infrastructure.
  • Engage during incidents using an 'escort model' to ensure compliance with FedRAMP data handling requirements.
  • Maintain critical monitoring for triage and escalations in the federal space and improve automated recovery processes.
  • Adhere to established incident and change processes and drive continuous improvements.

Requirements

  • U.S. Citizenship is required to comply with U.S. federal government regulations.
  • 6+ years of relevant experience in a related field.
  • Expertise in Cloud Native technologies with experience operating production services in the cloud.
  • Strong fundamentals of Distributed Systems and their design.
  • Deep knowledge of Kubernetes and containerization.
  • Experience with telemetry tooling to monitor production systems.
  • Confidence in problem-solving and troubleshooting critical services.
  • Proficiency with scripting and automation (e.g., Go, Java, Python, Bash).
  • Working knowledge of infrastructure as code (e.g., Terraform, Cloudformation, AWS CDK, Pulumi).
  • Exceptional teamwork and collaboration skills, with the ability to act critically with minimal supervision in a remote-first environment.
  • Experience with a rotating on-call schedule to provide 24/7 support.
  • BS Degree in Computer Science, Engineering, or equivalent experience.

Benefits

  • Competitive pay and benefits in line with industry standards.
  • Annual estimated salary of $145,920 - $171,440 USD.
  • Annual bonus and competitive equity package.
  • Wide range of employee benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service