TL;DR

SRE Manager (System Engineering): Lead and manage the HUB SRE team responsible for reliability, availability, and performance of critical hybrid cloud and bare-metal services with an accent on SRE principles, incident management, and capacity planning. Focus on building observability, reducing toil, evolving CI/CD pipelines, and mentoring engineers in SRE practices.

Company

TradingView is the world’s largest financial analysis platform with over 100 million users globally, providing advanced charting and market data tools trusted by major companies.

What you will do

  • Lead and manage the HUB SRE team, fostering a culture based on SRE principles including SLOs, error budgets, and toil reduction.
  • Define and implement SLOs/SLIs/error budgets for critical services to make reliability measurable and actionable.
  • Drive incident management processes including on-call rotations, blameless post-mortems, and structured incident response.
  • Build and improve observability tools such as metrics, alerting, distributed tracing, and dashboards.
  • Drive capacity planning and performance engineering to ensure service scalability and prevent outages.
  • Collaborate with backend teams to review architectures and advocate for reliability improvements.

Requirements

  • Proven experience as an Engineering Manager, SRE Lead, or Reliability Engineering Lead managing engineering teams.
  • Deep understanding of SRE discipline including SLOs, error budgets, toil classification, capacity planning, and incident management.
  • Strong technical background in backend systems, Linux, networking, and distributed systems.
  • Experience with hybrid infrastructure: cloud and bare-metal servers.
  • Experience building observability and optimizing CI/CD pipelines.
  • Excellent communication and people management skills.

Nice to have

  • Experience with high-load systems and strict latency requirements.
  • Familiarity with chaos engineering and proactive reliability testing.
  • Knowledge of Infrastructure-as-Code tools like Terraform and Ansible.

Culture & Benefits

  • Flexible working hours and hybrid work format.
  • Well-equipped offices for focused and collaborative work.
  • Global distributed team of 500+ professionals.
  • Learning, mentorship, and career growth opportunities.
  • Relocation support and private health insurance.
  • Performance-based bonuses and TradingView Premium access.
  • Regular team events and company-wide meetups.