TL;DR
Senior DevOps / SRE Specialist: Maintaining Client's Cyber Security SaaS cloud platform with an accent on optimal performance and smart cloud spending. Focus on infrastructure-as-code, monitoring and observability systems implementation and tuning, on-call incident response and post-mortem reviews to continuously improve system resilience.
Location: Remote
Company
Ciklum is a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges.
What you will do
- Develop, improve, and maintain Client's Cyber Security SaaS cloud platform
- Lead problem-solving efforts for the entire technology stack in collaboration with other teams in the R&D
- Work cross-functionally on cost optimization strategies to ensure infrastructure is both efficient and budget-conscious
- Practice infrastructure as a code (IaC) and GitOps using technologies like Terraform an ArgoCD
- Collaborate with Client's development and research groups to constantly improve our platform and infrastructure
- Develop and evolve our tooling, logging, monitoring, and alerting mechanisms to increase observability and transparency
Requirements
- Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes)
- Excellent problem-solving skills and ability to think critically about complex technical challenges and optimize production systems
- Experience in managing and troubleshooting Linux systems
- Experience with observability systems such as Datadog/Splunk/New Relic/Grafana, or similar
- Experience in Shell scripting and/or high-level Programming like Python and Go
- Experience working with cloud environments like GCP, Linode, AWS, and Azure
- Excellent verbal and written English communication and presentation skills
Nice to have
- Experience with cloud cost monitoring and optimization tools (e.g., GCP Cost Explorer, AWS Cost Explorer, Kubecost, etc.)
Culture & Benefits
- Work alongside top professionals in a friendly, open-door environment
- Take on large-scale projects with a global impact and expand your expertise
- Boost your skills with internal events (meetups, conferences, workshops), Udemy access, language courses, and company-paid certifications
- Explore diverse domains through internal mobility, finding the best fit to gain hands-on experience with cutting-edge technologies
- Enjoy radical flexibility – work remotely or from an office, your choice
- We’ve got you covered with company-paid medical insurance, mental health support, and financial & legal consultations
