SpaceX - Hawthorne, CA

posted 23 days ago

Full-time - Mid Level
Hawthorne, CA
Transportation Equipment Manufacturing

About the position

The Sr. Site Reliability Engineer for Data at SpaceX plays a crucial role in developing and maintaining mission-critical applications that support the company's ambitious goals, including enabling human life on Mars and expanding the Starlink network. This position involves full ownership of complex problems, collaborating with engineers across various programs to create scalable and maintainable systems that enhance the efficiency of launch vehicle production and flight operations.

Responsibilities

  • Upgrade existing distributed systems to become sharded and geo-redundant in multiple data centers
  • Advance existing deployment, monitoring, and alerting infrastructure to support a multi-region environment
  • Manage petabyte scale bare metal compute clusters
  • Closely collaborate with engineers across all programs to create highly operable, scalable, and maintainable products
  • Engage throughout the whole software development lifecycle of services -- from inception to design, deployment, operation, and iterative refinement
  • Focus on performance bottlenecks and performance improvement techniques

Requirements

  • Bachelor's degree in computer science, engineering, math, or scientific discipline and 5 years of software development experience OR 7+ years of professional experience building software with site reliability or DevOps in lieu of a degree
  • Experience with Linux operating systems

Nice-to-haves

  • 5+ years of rigorous experience with site reliability or DevOps
  • Experience with Kubernetes and Istio for on-premise deployment
  • Experience within-stream, data processing and analytics using open source platforms such as Apache Kafka, Spark, HBase, HDFS, Flink
  • Experience troubleshooting hardware and network-layer issues
  • Programming experience in Python, C#, Java, Scala, or similar languages
  • Good understanding of version control, testing, continuous integration, build, deployment and monitoring

Benefits

  • Comprehensive medical, vision, and dental coverage
  • 401(k) retirement plan
  • Short & long-term disability insurance
  • Life insurance
  • Paid parental leave
  • Various discounts and perks
  • 3 weeks of paid vacation
  • 10 or more paid holidays per year
  • 5 days of sick leave per year
  • Long-term incentives in the form of company stock, stock options, or long-term cash awards
  • Potential discretionary bonuses
  • Employee Stock Purchase Plan
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service