SpaceX - Hawthorne, CA

posted 3 months ago

Full-time
Hawthorne, CA
Transportation Equipment Manufacturing

About the position

As a Site Reliability Engineer at SpaceX, you will play a crucial role in designing, developing, and testing key components of an in-house solution aimed at analysis, simulation, and prototyping of software that supports all SpaceX flight systems. This position offers you full ownership of new features in automation and technical infrastructure, which are essential for supporting scalable, high-performance web applications, as well as a suite of simulation and emulation products. Your work will have a significant impact, as you will collaborate across various engineering groups to build a high-throughput distributed system that is used to develop and demonstrate cutting-edge software and hardware. SpaceX is seeking intelligent and motivated software engineers who thrive on tackling complex challenges, excel in dynamic environments, and are committed to adhering to software best practices. The application software you will be working on is critical to the success of future missions, and you will encounter a plethora of interesting challenges that require innovative and advanced solutions. Your responsibilities will include developing automation for deploying and managing applications both on-premises and in the cloud, managing core infrastructure technologies such as application servers, databases, messaging queues, and storage. You will closely collaborate with software engineers to create highly scalable, operable, and maintainable products, and work with IT and software engineers to develop a test automation suite leveraging DevOps infrastructure. Additionally, you will engage in and improve the entire lifecycle of services, from inception and design through deployment, operation, and refinement.

Responsibilities

  • Develop automation to deploy and manage applications both on-premises and in the cloud
  • Deploy and manage core infrastructure technologies such as application servers, databases, messaging queues, and storage
  • Closely collaborate with software engineers to create highly scalable, operable, and maintainable products
  • Closely collaborate with IT and software engineers to develop a test automation suite leveraging DevOps infrastructure
  • Engage in and improve the whole lifecycle of services -- from inception and design, through deployment, operation, and refinement

Requirements

  • Bachelor's degree in computer science, information systems/IT, or an engineering discipline; OR 2+ years of professional experience in software, DevOps, or site reliability engineering in lieu of a degree
  • 1+ year of experience with Linux operating systems
  • Experience with containerization technologies (i.e. Docker, Kubernetes)
  • Experience with designing and managing solutions in cloud environments such as AWS, Azure or GCP
  • Experience in Bash, Python, and/or other scripting languages
  • Active Secret, Top Secret, Top Secret SCI, OR ability and willingness to obtain a Top Secret clearance

Nice-to-haves

  • 1+ years of systems administration, site reliability engineering, or DevOps experience
  • 1+ years of experience with Python and Python-based development frameworks
  • Strong understanding of Kubernetes, Docker, or similar technologies
  • Strong understanding of message queue technologies such as RabbitMQ or Kafka
  • Strong understanding of virtualization and hypervisor technologies
  • Understanding of databases and performance tuning
  • Experience with identity management and authentication protocols
  • Focus on performance bottlenecks and performance improvement techniques
  • Excellent communications skills with the ability to communicate with customers, peers, management etc. in both formal and informal situations
  • Ability to quickly learn new tools and frameworks

Benefits

  • life insurance
  • parental leave
  • paid holidays
  • sick time
  • 401(k)
  • retirement plan
  • stock options
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service