v0.6.1
Highlights
Trainer
- support fp16 training (FSDP/Megatron)
Megatron
- support 1f1b_overlap/moe_a2a_overlap
- support for Qwen3VL MoE/dense models
- support Qwen2.5/3vl with context parallel
Rollout
- Use vllm and sglang release image as ci base image, upgrade vllm==0.11.0, upgrade sglang==0.5.5
- Prometheus monitoring
Algorithm
- Rollout Correction: comprehensive overhaul of the rollout correction system with typed configuration, mathematical documentation, and performance optimizations.