TL;DR
Site Reliability Engineer (DevOps): Designing, developing, and testing key aspects of an in-house solution for analysis, simulation, and prototyping of software in support of all SpaceX flight systems with an accent on automation and technical infrastructure. Focus on building high-throughput distributed systems and ensuring high resilience, performance, and scalability.
Location: Must be based in Hawthorne, CA
Salary: $125,000.00 - $175,000.00/per year
Company
SpaceX is actively developing the technologies to make human life on Mars possible.
What you will do
- Develop automation to deploy and manage applications both on-premises and in the cloud.
- Install, manage, scale and optimize Kubernetes and RKE clusters using Ansible and adjacent technologies in production environments.
- Collaborate with software engineers to create highly scalable, operable and maintainable products.
- Engage in and improve the whole lifecycle of services -- from inception and design, through deployment, operation and refinement.
- Build highly resilient, high-performance, scalable, and robust systems.
Requirements
- Bachelor’s degree in computer science, information systems, or an engineering discipline; OR 2+ years of professional experience in software, DevOps, or site reliability engineering in lieu of a degree.
- 1+ year of experience with Linux operating systems.
- Experience in Bash, Python, or other scripting languages.
- Active Top Secret, Top Secret SCI, or DOE Level Q clearance.
- To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State.
Nice to have
- 1+ years of systems administration, site reliability engineering, or DevOps experience.
- Experience with containerization technologies (i.e. Docker, Kubernetes).
- Strong understanding of Kubernetes, Docker, or similar technologies.
- Strong understanding of message queue technologies such as RabbitMQ or Kafka.
- Experience with dynamic system configuration templating using Jinja, YAML and Helm.
Culture & Benefits
- Eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan.
- Access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short and long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks.
- Accrue 3 weeks of paid vacation and will be eligible for 10 or more paid holidays per year.
- Exempt employees are eligible for 5 days of sick leave per year.
