Monarch 0.3.0 Release Notes
New Features
Kubernetes Job Support
Monarch now supports running distributed training workloads on Kubernetes clusters. The new KubernetesJob API connects to pre-provisioned GPU pods managed by the https://github.com/meta-pytorch/monarch-kubernetes/ repository, enabling seamless multi-node DDP training on Kubernetes.
Key Capabilities:...