Instabase - San Francisco, CA

posted 2 months ago

Full-time - Mid Level
San Francisco, CA
Professional, Scientific, and Technical Services

About the position

At Instabase, we are dedicated to democratizing access to cutting-edge AI innovation, enabling organizations to tackle previously unsolvable unstructured data challenges. Our Site Reliability and Platform Engineering team plays a pivotal role in building and maintaining scalable, distributed systems that are robust and reliable. This position is designed for individuals with a passion for integrating software engineering and systems engineering to ensure our platforms are ready to scale and meet the demands of our diverse clientele, which includes some of the largest organizations globally. As a Site Reliability Engineer, you will take ownership of features, designing, implementing, and managing them independently. You will collaborate with cross-functional teams to contribute to and execute technical strategies that balance immediate needs with long-term objectives. Your responsibilities will include managing and optimizing cloud infrastructure, ensuring reliability and efficiency in deployment automation, and supporting production systems to maintain uptime through proactive monitoring and issue resolution. In addition to these core responsibilities, you will assist in vulnerability management to uphold system security and integrity, contribute to the development and maintenance of CI/CD pipelines, and implement tools that enhance developer productivity. You will also play a key role in release management processes, ensuring smooth and reliable software releases, and you will have the opportunity to mentor new engineers, providing guidance as they integrate into the team. This role is ideal for someone who thrives in a collaborative environment and is eager to contribute to the success of our innovative platform.

Responsibilities

  • Design, implement, and manage features independently, demonstrating strong ownership and accountability for the systems you work on.
  • Collaborate with cross-functional teams to contribute to and execute technical strategies, balancing short-term needs with long-term goals.
  • Manage and optimize cloud infrastructure and deployment automation, ensuring reliability and efficiency.
  • Support and enhance production systems, maintaining uptime and reliability through proactive monitoring and issue resolution.
  • Assist in managing and addressing vulnerabilities to maintain system security and integrity.
  • Contribute to the development and maintenance of CI/CD pipelines and build systems to streamline development processes.
  • Implement and refine tools that improve developer productivity and streamline workflows.
  • Support and improve release management processes, ensuring smooth and reliable software releases.
  • Help onboard and mentor new engineers, providing guidance and support as they integrate into the team.

Requirements

  • 2+ years of experience in Site Reliability Engineering, Software Engineering, or Production Engineering, with a demonstrated ability to design and complete features independently.
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
  • Strong understanding of the systems and products you work on, including their upstream and downstream dependencies.
  • Experience with major cloud providers such as AWS or Azure.
  • Familiarity with containerization technologies like Docker.
  • Experience with container orchestration platforms such as Kubernetes.
  • Experience supporting and improving release management processes.
  • A proactive, problem-solving mindset with a passion for automation and system reliability.
  • Ability to work effectively with other teams (PM, design, SE, etc.) and contribute to achieving higher-level goals.

Benefits

  • Annual bonus
  • Equity
  • Comprehensive benefits package
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service