TL;DR

SRE Engineer (Monitoring Tools): Maintaining and optimizing high-scale observability infrastructure with an accent on Linux administration, system reliability, and automation. Focus on building distributed monitoring solutions, incident management, and scaling custom observability platforms in a high-concurrency SaaS environment.

Company

JettyCloud is a product-driven organization focused on developing mission-critical observability solutions and monitoring platforms.

What you will do

  • Maintain and support the availability and health of internal monitoring and alerting infrastructure.
  • Lead incident resolution efforts and participate in a sustainable on-call rotation.
  • Evolve and improve the custom monitoring stack to meet evolving business requirements.
  • Collaborate with development teams to integrate observability solutions into the software development lifecycle.
  • Manage system capacity to handle growth in a high-concurrency SaaS environment.
  • Contribute to the codebase in Go or Python to automate operational toil and extend system integrations.

Requirements

  • 4+ years of experience as an SRE or Systems Engineer in production environments.
  • Strong Linux administration and performance tuning proficiency.
  • Experience with Go or Python for system interaction and automation.
  • Understanding of SaaS telemetry, monitoring domains, and alerting theory.
  • Experience operating cloud platforms like AWS or GCP.
  • B.S. in Computer Engineering, Computer Science, or a related field.

Nice to have

  • Experience operating systems in large-scale, heterogeneous environments.
  • Hands-on experience with ClickHouse, VictoriaMetrics, or similar TSDBs.
  • Proficiency in IaC tools like Terraform and Ansible.
  • Experience with Kubernetes and container orchestration.

Culture & Benefits

  • Professional team environment focused on cutting-edge technologies.
  • Opportunities for professional career growth and self-realization.
  • Comprehensive health and life insurance package.
  • Employee assistance program.
  • 25 days of vacation per year.