Unclassified - San Diego, CA

posted 2 months ago

Full-time - Mid Level
Remote - San Diego, CA

About the position

As a Staff Site Reliability Engineer at Mango Technologies, Inc. dba ClickUp, you will play a crucial role in ensuring the reliability, availability, and performance of our services. This position is fully remote, allowing you to work from anywhere in the continental U.S., while reporting to our headquarters in San Diego, CA. You will be part of a dynamic team that is dedicated to maintaining and improving our infrastructure, ensuring that our systems are robust and scalable to meet the demands of our growing user base. In this role, you will be responsible for designing and implementing solutions that enhance system reliability and performance. You will work closely with development teams to ensure that new features are designed with reliability in mind, and you will be involved in incident response and post-mortem analysis to continuously improve our systems. Your expertise will be vital in automating processes and improving operational efficiency, allowing us to deliver a seamless experience to our users. We are looking for someone who is not only technically proficient but also embodies our company values of hard work, growth, and innovation. You will have the opportunity to contribute to a culture that encourages creativity and collaboration, and where your ideas can have a significant impact on our success. If you are passionate about site reliability engineering and are eager to take on new challenges, we would love to hear from you!

Responsibilities

  • Design and implement solutions to enhance system reliability and performance.
  • Collaborate with development teams to ensure reliability is considered in new features.
  • Participate in incident response and post-mortem analysis to improve systems.
  • Automate processes to improve operational efficiency.
  • Monitor system performance and troubleshoot issues as they arise.
  • Develop and maintain documentation related to system architecture and processes.

Requirements

  • Proven experience in site reliability engineering or a related field.
  • Strong knowledge of cloud infrastructure and services.
  • Experience with automation tools and scripting languages.
  • Familiarity with monitoring and logging tools.
  • Ability to work collaboratively in a remote team environment.
  • Excellent problem-solving skills and attention to detail.

Nice-to-haves

  • Experience with container orchestration tools like Kubernetes.
  • Knowledge of CI/CD pipelines and DevOps practices.
  • Familiarity with database management and optimization.
  • Experience in a fast-paced startup environment.

Benefits

  • Equity
  • 401(k) with up to 2% match
  • ClickUp swag
  • Teammate recognition award
  • Professional development program
  • Health insurance
  • Dental insurance
  • Paid parental leave
  • Flexible paid time off
  • Sabbatical program
  • Wellness stipend
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service