Inworld Ai - Moffett Field, CA

posted 14 days ago

Full-time - Mid Level
Moffett Field, CA

About the position

Inworld is seeking a Staff Cloud DevOps/Site Reliability Engineer to join its Technical Operations team. This role focuses on managing the infrastructure, DevOps, and Site Reliability of Inworld's AI platform, which powers interactive gaming experiences for major industry players. The position requires a strong background in DevOps practices and cloud technologies, with a focus on maintaining and improving the platform's reliability and performance.

Responsibilities

  • Maintain and contribute to Infrastructure-as-Code using Terraform.
  • Orchestrate CI/CD pipelines using GitHub Actions, Helm, and ArgoCD.
  • Administer Kubernetes for microservices scalability.
  • Manage cloud infrastructure and services.
  • Measure and monitor service availability, latency, and overall health.
  • Drive incident management and conduct post-mortem analysis.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • 7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer.
  • At least 2 years of experience with Terraform.
  • At least 2 years of experience with Helm.
  • At least 2 years of experience with Kubernetes.
  • Experience with AWS, Azure, or GCP.
  • Experience with CI/CD using modern tools (GitOps).

Nice-to-haves

  • Experience with MLOps (building, orchestrating, and maintaining Machine Learning Pipelines).
  • Familiarity with Prometheus/Grafana for monitoring.
  • Experience with multi-cloud deployments (2 or more).
  • Knowledge of ArgoCD for continuous delivery.
  • Experience with network management and VPNs.

Benefits

  • Equity in the company.
  • Comprehensive health benefits.
  • Flexible working arrangements.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service