Insight Global - San Diego, CA

posted 2 months ago

Full-time - Mid Level
San Diego, CA
Administrative and Support Services

About the position

We are looking for a Site Reliability Engineer (SRE) who will play a crucial role in maintaining and enhancing our large-scale mission-critical environments, ensuring zero downtime. The SRE team is composed of professionals with backgrounds in Database Administration (DBA), Site Reliability Engineering (SRE), and Database Reliability Engineering (DBRE). Our primary objective is to provide highly available data services at scale, which requires a commitment to building an extremely reliable, performant, and secure database infrastructure. This is achieved through the skillful use of automation and innovative solutions. As a Site Reliability Engineer, you will be responsible for designing and implementing new architectures and scalability solutions to meet the ever-growing business and data processing needs. You will work closely with cross-functional teams to ensure that our systems are robust and can handle high traffic while maintaining performance and reliability. Your expertise will be essential in troubleshooting and resolving issues in real-time, ensuring that our applications remain operational and efficient. We are committed to fostering a diverse and inclusive work environment where all employees can bring their authentic selves to work. We believe that diversity drives innovation and success, and we are an equal opportunity employer. We encourage qualified candidates from all backgrounds to apply, and we provide reasonable accommodations for individuals with disabilities during the application and recruitment process.

Responsibilities

  • Design and implement scalable architectures for mission-critical applications.
  • Ensure high availability and performance of data services.
  • Automate processes to enhance reliability and efficiency.
  • Monitor system performance and troubleshoot issues in real-time.
  • Collaborate with cross-functional teams to meet business and data processing needs.
  • Utilize observability tools to maintain system health and performance.

Requirements

  • Bachelor's degree or foreign equivalent in Computer Science, Information Systems, or a related field.
  • 2 years of experience in the job offered or as a Computer Systems Engineer, Software Engineer, or related job titles.
  • Experience supporting mission-critical, real-time, high-traffic applications in cloud environments.
  • Knowledge of cloud systems and continuous integration/build systems.
  • Proficiency in Java, SQL, and NoSQL databases.
  • Experience with observability tools such as Grafana, Prometheus, and Zabbix.
  • Proficiency in scripting/programming languages such as Python or GoLang.
  • Familiarity with one or more open-source technologies like Elasticsearch, Kafka, or Redis.
  • Experience with container technologies such as Docker, Kubernetes, or Mesos.

Nice-to-haves

  • Mandarin speaking proficiency.
Ā© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service