Tesla - Fremont, CA

posted 10 days ago

Full-time - Mid Level
Fremont, CA
Transportation Equipment Manufacturing

About the position

Tesla's Platform Engineering team is seeking a Site Reliability Engineer to build and maintain Kubernetes clusters using infrastructure-as-code tools. This role involves supporting application teams and managing a diverse infrastructure that includes on-premise VMs, bare metal hosts, and public clouds like AWS. The ideal candidate will have strong Linux expertise, software development skills, and experience with Kubernetes in production environments. This position is critical for running production workloads and setting standards across engineering teams at Tesla.

Responsibilities

  • Hands-on with developers to deploy applications and provide support.
  • Building new features to improve platform stability and updates.
  • Manage Kubernetes clusters on-prem and in the cloud to support growing workloads.
  • Participate in architecture design and troubleshoot live applications with product teams.
  • Participate in a 24x7 on-call rotation (12 hours day shift once a week and a weekend shift once every 6-8 weeks).
  • Influence architectural decisions focusing on security, scalability, and high performance.
  • Setup and maintain monitoring, metrics, and reporting systems for observability and alerting.
  • Author technical documentation for workflows, processes, and best practices.

Requirements

  • Experience managing web-scale infrastructure in a production *nix environment.
  • Ability to prioritize tasks and work independently with an analytical mind and a bias for action.
  • Advanced or expert-level Linux administration and performance tuning skills.
  • Bachelor's Degree in Computer Science, Computer Engineering, or equivalent experience.
  • Advanced experience with configuration management systems such as Ansible, Terraform, or Puppet.
  • Demonstrable knowledge of Linux operating system internals, networking stack, filesystems, resource scheduling, and process management.
  • Exposure to AWS or other cloud infrastructure providers.
  • Experience managing container-based workloads using Kubernetes or other orchestration software in production.
  • Proficiency in a high-level language like Python, Go, Ruby, and/or Java.

Benefits

  • Aetna PPO and HSA plans with $0 payroll deduction options.
  • Family-building, fertility, adoption, and surrogacy benefits.
  • Dental and vision plans with $0 paycheck contribution options.
  • Company Paid HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSA.
  • Healthcare and Dependent Care Flexible Spending Accounts (FSA).
  • LGBTQ+ care concierge services.
  • 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits.
  • Company paid Basic Life, AD&D, short-term and long-term disability insurance.
  • Employee Assistance Program.
  • Sick and Vacation time (Flex time for salary positions), and Paid Holidays.
  • Back-up childcare and parenting support resources.
  • Voluntary benefits including critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance.
  • Weight Loss and Tobacco Cessation Programs.
  • Tesla Babies program.
  • Commuter benefits.
  • Employee discounts and perks program.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service