Scaling ML Training on Kubernetes with JobSet

JobSet is a Kubernetes-native API for managing distributed ML and HPC jobs with support for multi-role pods, topology-aware placement, and scaling.

  • Kubernetes
Abhimanyu Saharan
Abhimanyu Saharan

Filter by category