TL;DR

AI/ML Specialist Solution Architect (AI): Supporting the design and deployment of AI/ML solutions on GPU cloud infrastructure with an accent on distributed training and inference optimization. Focus on helping customers adopt large-scale AI workloads while developing expertise in modern ML frameworks and cloud systems.

Location: Must be based in the USA

Company

Nebius is a global cloud platform provider focused on serving the AI economy with specialized, high-performance infrastructure.

What you will do

  • Assist in the design and documentation of scalable AI and ML cloud solutions.
  • Learn to operate distributed training and inference workloads on multi-GPU platforms.
  • Create technical content including presentations, tutorials, and system demos.
  • Participate in investigation, troubleshooting, and optimization of customer AI workloads.
  • Collaborate with engineering and product teams to support customer needs.

Requirements

  • Current university student, recent graduate, or early-career specialist.
  • Must be authorized to work in the USA.
  • Solid understanding of machine learning concepts and workflows.
  • Familiarity with at least one ML framework such as PyTorch, TensorFlow, or JAX.
  • Proficiency in Python programming.
  • Strong communication skills and interest in cloud infrastructure.

Nice to have

  • Exposure to container technologies like Docker and Kubernetes.
  • Familiarity with Git or basic DevOps workflows.
  • Project experience with distributed systems or MLOps.

Culture & Benefits

  • Mentorship from experienced AI, ML, and cloud infrastructure professionals.
  • Hands-on experience with real customer workloads in production environments.
  • Remote work flexibility.
  • Supportive, collaborative team environment.
  • Potential for consideration for a full-time role after the program.