TL;DR
Infrastructure Team Lead (SRE/DevOps): Leading a distributed engineering team to build and scale highly available infrastructure for a global revenue OS with an accent on system reliability, uptime metrics, and performance. Focus on designing geo-distributed disaster recovery plans, scaling storage for millions of concurrent users, and mentoring engineers in efficient delivery practices.
Company
Adapty is a fast-growing revenue OS for mobile app businesses that simplifies subscription management, analytics, and paywall optimization.
What you will do
- Lead and coach a distributed infrastructure team to ensure high delivery standards and uptime.
- Improve uptime metrics including MTBF and MTTR using symptom-based monitoring.
- Design and maintain scalable infrastructure to support millions of concurrent users.
- Make key architectural decisions balancing short-term product needs with long-term vision.
- Educate and guide the engineering team on reliable delivery and infrastructure practices.
Requirements
- 8+ years of experience building engineering solutions with Kubernetes, Python, PostgreSQL, Clickhouse, and Kafka.
- Proven ability to program using high-level languages like Python, Golang, C/C++, or JavaScript.
- Track record of making key architectural decisions in high-load SaaS environments.
- Demonstrated experience resolving production bottlenecks and managing critical incidents.
- Strong system-level mindset with a focus on root-cause analysis and collaborative decision-making.
Culture & Benefits
- Flexible remote work policy.
- Support for professional development through English lessons and other resources.
- Comprehensive equipment coverage and sports reimbursements.
- Environment focused on direct communication, high ownership, and minimal bureaucracy.
