TL;DR
Senior Software Engineer (AI Compute): Managing and scaling Airbnb’s Kubernetes-based GPU infrastructure to power machine learning initiatives with an accent on developer experience, operational efficiency, and system reliability. Focus on designing platform architecture, optimizing GPU fleet performance, and executing multi-year strategies for AI compute capabilities.
Location: Remote eligible; must reside in a US state where Airbnb has a registered entity.
Salary: $191,000–$225,000 USD
Company
Airbnb is a global platform that connects millions of hosts and guests through unique stays and experiences.
What you will do
- Serve as technical lead for the lifecycle management of the Kubernetes-based GPU platform.
- Enhance ML engineering workflows to improve developer productivity and operational efficiency.
- Drive reliability, scalability, and security across the AI compute fleet.
- Execute the multi-year strategy for AI infrastructure development and maintenance.
- Coach and influence a distributed team of engineers on high-impact projects.
- Facilitate alignment across cross-functional teams regarding platform goals and deliverables.
Requirements
- Must reside in a US state with a registered Airbnb entity.
- 5+ years of relevant experience in infrastructure engineering.
- 2+ years of expertise with a public cloud provider (AWS, GCP, Azure).
- Proven experience with Kubernetes.
- Experience planning and executing large projects across multiple teams.
- BS, MS, or Ph.D. in Computer Science or equivalent experience.
Nice to have
- Strong background in ML Infrastructure, including LLM fundamentals, tuning, and optimization.
Culture & Benefits
- Eligibility for bonus, equity, and company benefits.
- Employee Travel Credits provided.
- Commitment to an inclusive and diverse workplace culture.
- Occasional opportunities for in-person collaboration at Airbnb offices and offsites.
