Qualcommposted about 1 month ago
$134,800 - $202,200/Yr
Full-time • Mid Level
Hybrid • San Diego, CA
Computer and Electronic Product Manufacturing

About the position

We're seeking a talented DevOps Engineer to drive the deployment of large-scale ML applications and models. You'll leverage your expertise in Docker, Terraform, K3s, and Kubernetes (K8s) to build CI/CD pipelines, manage containerized workloads, and create deployment scripts tailored to our AI accelerator chips. This role demands a solid understanding of LLMs, AI agents, and programming in both compiled and high-level languages.

Responsibilities

  • Deploy and scale ML applications and models using Kubernetes (K8s) and lightweight K3s clusters, optimizing for our AI accelerator chips.
  • Containerize ML models and workloads with Docker, ensuring seamless integration with our hardware.
  • Use Terraform to define and manage infrastructure as code for consistent, repeatable deployments.
  • Build and maintain CI/CD pipelines to automate the release of ML models and applications.
  • Work with Kubernetes APIs to automate orchestration, scaling, and management of containerized environments.
  • Develop deployment scripts and tools to empower users to adopt our AI solutions effortlessly.
  • Collaborate with ML teams to troubleshoot and enhance production deployments.
  • Apply your knowledge of LLMs and AI agents to optimize deployment workflows.
  • Write efficient code in a compiled language (e.g., C, C++, Rust) and a high-level language (e.g., Python, Go) as needed.
  • Ensure system reliability, security, and performance for mission-critical ML workloads.

Requirements

  • Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience.
  • OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience.
  • OR PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.
  • 2+ years of work experience with Programming Language such as C, C++, Java, Python, etc.
  • Proven experience deploying large-scale ML applications in production environments.
  • Strong proficiency with Docker for containerization and Kubernetes (K8s) for orchestration.
  • Hands-on experience with K3s for lightweight, edge-friendly ML deployments.
  • Expertise in Terraform for infrastructure provisioning and management.
  • Demonstrated ability to design and implement CI/CD pipelines (e.g., Jenkins, GitLab CI, GitHub Actions).
  • Familiarity with large language models (LLMs), AI agents, and their operational needs.
  • Proficiency in a compiled language (e.g., C, C++, Rust) and a high-level language (e.g., Python, Go).
  • Comfortable with automation scripting (e.g., Bash, Python) and system monitoring tools.
  • Understanding of hardware-accelerated computing (e.g., GPUs, custom AI chips) is a plus.
  • Strong problem-solving skills and a passion for enabling user success.

Nice-to-haves

  • Understanding of ML frameworks like TensorFlow or PyTorch.
  • Knowledge of cloud platforms (e.g., AWS, GCP, Azure) and hybrid environments.
  • Background in optimizing workloads for specialized AI hardware.

Benefits

  • $134,800.00 - $202,200.00 salary range.
  • Competitive annual discretionary bonus program.
  • Opportunity for annual RSU grants.
  • Highly competitive benefits package designed to support your success at work, at home, and at play.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service