TL;DR

Software Engineer Intern (Distributed Systems): Develop and deploy next-generation high-performance distributed systems with an accent on time-sensitive applications and nanosecond precision. Focus on designing scalable software, debugging, profiling, and optimizing system performance across multiple layers of the stack.

Location: Onsite in Palo Alto, CA, United States

Company

Clockwork.io pioneers software-driven AI fabrics to increase GPU cluster utilization by delivering cross-stack observability, workload fault tolerance, and performance acceleration.

What you will do

  • Contribute to design and development of high-performance distributed systems
  • Build reliable, scalable software for time-sensitive applications
  • Collaborate with engineers on core infrastructure, networking, and systems components
  • Debug, profile, and optimize system performance across multiple stack layers
  • Work closely with a small team of researchers and developers on production challenges

Requirements

  • Location: Must be onsite in Palo Alto, CA, United States
  • Currently pursuing Bachelor’s, Master’s, or PhD in Computer Science, Electrical Engineering, or related field
  • Proficiency in C, C++, Golang, or Python
  • Strong understanding of data structures, algorithms, concurrency, and parallel programming
  • Experience with kernel programming and networking protocols
  • Strong debugging and problem-solving skills

Nice to have

  • Full-stack development experience
  • Experience with distributed systems, networking, or low-level systems programming

Culture & Benefits

  • Equal opportunity employer committed to diversity and inclusion
  • Work with a team of experienced researchers and engineers
  • Opportunity to work on cutting-edge high-performance infrastructure