Highlights

Model Engine

As noted in https://github.com/volcengine/verl/issues/3624, model engine is a service that provides APIs for manipulation of a parallel and distributed model using single controller. This release provides a prototype for such idea using FSDP + ulysses backend and megatron core backend. The implementation is under https://github.com/volcengine/verl/tree/main/verl/workers/engine. Currently, we only implement SFT trainer using model engine. In the following releases, we will start to implement RL trainer using model engine. Please refer to https://verl.readthedocs.io/en/latest/workers/model_engine.html for the design and instructions to add more model engine backends.

Rollout Server

As agentic reinforcement learning emerges as a predominant research area, verl rollout is transitioning from SPMD mode to server mode, which is more efficient for multi-turn rollout and tool calling. In version 0.6, we made several major changes to rollout servers:

SGLang: https://github.com/volcengine/verl/pull/3090 completely separates the SGLang process from the trainer process in SPMD mode and introduces a server adapter to synchronize weights between the trainer and SGLang server. Furthermore, https://github.com/volcengine/verl/pull/3456 migrates SGLang to native server mode, enabling full-fledged features and optimizations for online serving.
vLLM: While the vLLM model_runner remains within the trainer process, https://github.com/volcengine/verl/pull/3456 also transitions vLLM to native server mode. We may explore completely separating the vLLM process from the trainer process in future releases.

By switching to native server mode, https://github.com/volcengine/verl/pull/3530 adds DP+EP support for large MoE models.

To improve extensibility, https://github.com/volcengine/verl/pull/3285 refactors the BaseRollout interface and deprecates all sharding managers. This refactor ensures the training engine remains agnostic of the inference engine during weight synchronization, making it easier to integrate new inference engines (e.g., TensorRT-LLM) without modifying the training engine.

Newly Supported Models

Qwen3 VL
GPT OSS

Algorithm

GSPO
Token-level TIS: https://github.com/volcengine/verl/pull/2953 introduces token-level importance sampling to mitigate the gap between rollout and training.
Sequence-level TIS: https://github.com/volcengine/verl/pull/3694 add more comprehensive metrics to monitor distribution mismatch between rollout and training, and introduces sequence-level importance sampling.

Recipe

Some awesome recipes have been added in v0.6:

Breaking changes and deprecations

nD Dispatch method

Previously, we implemented a set of predefined dispatch method including ONE_TO_ALL, DP_COMPUTE_DATA_PROTO, MEGATRON_COMPUTE_DATA_PROTO, etc,. DP_COMPUTE_DATA_PROTO and MEGATRON_COMPUTE_DATA_PROTO are strongly correlated to the underlying distributed strategies. Writing a separate dispatch method for each strategy is not scalable. In this release, we propose a new API to to unify all distributed strategies. The general steps are

Define device meshes or process groups
register dispatch and collect info by calling _register_dispatch_collect_info inside the worker
Add registration for methods using @register(dispatch_mode=make_nd_compute_dataproto_dispatch_fn(mesh_name=mesh_name))

Please refer to https://github.com/volcengine/verl/blob/main/tests/single_controller/test_device_mesh_register.py as an example.

ShardingManager

ShardingManager is deprecated and will be removed in next release.

Importance bug fixes

Fix hang issue when mixing text and images data training in VLMs (e.g., Qwen VL)
Fix DataProto getstate bug

What's Changed

[cfg] refactor: add ActorConfig, EngineConfig, and ActorWorker unit test, refactor validation code by @eric-haibin-lin in https://github.com/volcengine/verl/pull/2621
[ci] test: add CriticWorker unit test, make some util CPU friendly by @eric-haibin-lin in https://github.com/volcengine/verl/pull/2717
[ray] feat: RayWorkerGroup support set worker env by @NKcqx in https://github.com/volcengine/verl/pull/2685
[sglang] fix: Adding strict naming sanity for sglang by @zhaochenyang20 in https://github.com/volcengine/verl/pull/2719
[misc] chore: bump main branch version to v0.5.0.dev by @eric-haibin-lin in https://github.com/volcengine/verl/pull/2718
[megatron] fix: resolve backward propagation error in megatron_actor due to shared logits tensor in-place modification by @HelloWorld686 in https://github.com/volcengine/verl/pull/2484
[tool] fix: geo3k create return by @nanjiangwill in https://github.com/volcengine/verl/pull/2714
[doc] feat: Add agent-lightning in the list of "awesome works using verl by @wizardlancet in https://github.com/volcengine/verl/pull/2726
[ci] fix: checkpoint_convertor ci miss a hf model download by @ETOgaosion in https://github.com/volcengine/verl/pull/2730
[recipe] chore: add retool training script by @wuxibin89 in https://github.com/volcengine/verl/pull/2732
[ci] fix: release ascend test time, fix one step off-policy CI by @ETOgaosion in https://github.com/volcengine/verl/pull/2731
[doc] feat: add resizable sidebar and improve layout by @Tingberer in https://github.com/volcengine/verl/pull/2577
[docker] feat: upgrade to torch 2.7, sglang 0.4.8 by @ETOgaosion in https://github.com/volcengine/verl/pull/2617
[megatron] feat: a bunch of optimzation on vram, sequence packing by @ISEEKYAN in https://github.com/volcengine/verl/pull/2678
[CI] feat: add mypy to pre-commit by @frrad in https://github.com/volcengine/verl/pull/2614
[doc] style: change resize handle from gradient to plain color by @Tingberer in https://github.com/volcengine/verl/pull/2746
refactor: Make sure to keep the type checking by @YeonwooSung in https://github.com/volcengine/verl/pull/2634
[rollout] feat: remove chat scheduler by @wuxibin89 in https://github.com/volcengine/verl/pull/2725
[perf] feat: add optional role selection in discrete mode for NPU Profiler by @tongtong0613 in https://github.com/volcengine/verl/pull/2750
[doc] feat: add retool blog by @eric-haibin-lin in https://github.com/volcengine/verl/pull/2761
[algo] refactor: don't special-case compute_policy_loss by @frrad in https://github.com/volcengine/verl/pull/2701
[BREAKING] [rollout] chore: remove default rollout selection by @vermouth1992 in https://github.com/volcengine/verl/pull/2757

New Contributors

@NKcqx made their first contribution in https://github.com/volcengine/verl/pull/2685
@HelloWorld686 made their first contribution in https://github.com/volcengine/verl/pull/2484
@wizardlancet made their first contribution in https://github.com/volcengine/verl/pull/2726
@Tingberer made their first contribution in https://github.com/volcengine/verl/pull/2577
@MikeDean2367 made their first contribution in https://github.com/volcengine/verl/pull/2768
@kibitzing made their first contribution in https://github.com/volcengine/verl/pull/2777
@MzeroMiko made their first contribution in https://github.com/volcengine/verl/pull/2795
@clearhanhui made their first contribution in https://github.com/volcengine/verl/pull/2805
@panf2333 made their first contribution in https://github.com/volcengine/verl/pull/2849
@chi2liu made their first contribution in https://github.com/volcengine/verl/pull/2827
@wantbook-book made their first contribution in https://github.com/volcengine/verl/pull/2666
@Qiao0124 made their first contribution in https://github.com/volcengine/verl/pull/2476
@techkang made their first contribution in https://github.com/volcengine/verl/pull/2883
@looput made their first contribution in https://github.com/volcengine/verl/pull/2353
@EasonZhong668 made their first contribution in https://github.com/volcengine/verl/pull/2884
@TomQunChao made their first contribution in https://github.com/volcengine/verl/pull/2808
@ethen8181 made their first contribution in https://github.com/volcengine/verl/pull/2050
@wlf-darkmatter made their first contribution in https://github.com/volcengine/verl/pull/2602
@nariaki3551 made their first contribution in https://github.com/volcengine/verl/pull/2957
@xylcbd made their first contribution in https://github.com/volcengine/verl/pull/2430
@RasulAlakbarli made their first contribution in https://github.com/volcengine/verl/pull/2900
@zdhNarsil made their first contribution in https://github.com/volcengine/verl/pull/2881
@MrAta made their first contribution in https://github.com/volcengine/verl/pull/2868
@liqiongyu made their first contribution in https://github.com/volcengine/verl/pull/2985
@HaochenYuan made their first contribution in https://github.com/volcengine/verl/pull/3007
@Maxwell-Jia made their first contribution in https://github.com/volcengine/verl/pull/2398
@philippnormann made their first contribution in https://github.com/volcengine/verl/pull/3029
@JingchengYang4 made their first contribution in https://github.com/volcengine/verl/pull/3036
@codemayq made their first contribution in https://github.com/volcengine/verl/pull/3053

Full Changelog: https://github.com/volcengine/verl/compare/v0.5.0...v0.6.0

What's Changed

[cfg] refactor: add ActorConfig, EngineConfig, and ActorWorker unit test, refactor validation code by @eric-haibin-lin in https://github.com/volcengine/verl/pull/2621
[ci] test: add CriticWorker unit test, make some util CPU friendly by @eric-haibin-lin in https://github.com/volcengine/verl/pull/2717
[ray] feat: RayWorkerGroup support set worker env by @NKcqx in https://github.com/volcengine/verl/pull/2685
[sglang] fix: Adding strict naming sanity for sglang by @zhaochenyang20 in https://github.com/volcengine/verl/pull/2719
[misc] chore: bump main branch version to v0.5.0.dev by @eric-haibin-lin in https://github.com/volcengine/verl/pull/2718
[megatron] fix: resolve backward propagation error in megatron_actor due to shared logits tensor in-place modification by @HelloWorld686 in https://github.com/volcengine/verl/pull/2484
[tool] fix: geo3k create return by @nanjiangwill in https://github.com/volcengine/verl/pull/2714
[doc] feat: Add agent-lightning in the list of "awesome works using verl by @wizardlancet in https://github.com/volcengine/verl/pull/2726
[ci] fix: checkpoint_convertor ci miss a hf model download by @ETOgaosion in https://github.com/volcengine/verl/pull/2730
[recipe] chore: add retool training script by @wuxibin89 in https://github.com/volcengine/verl/pull/2732
[ci] fix: release ascend test time, fix one step off-policy CI by @ETOgaosion in https://github.com/volcengine/verl/pull/2731
[doc] feat: add resizable sidebar and improve layout by @Tingberer in https://github.com/volcengine/verl/pull/2577
[docker] feat: upgrade to torch 2.7, sglang 0.4.8 by @ETOgaosion in https://github.com/volcengine/verl/pull/2617
[megatron] feat: a bunch of optimzation on vram, sequence packing by @ISEEKYAN in https://github.com/volcengine/verl/pull/2678
[CI] feat: add mypy to pre-commit by @frrad in https://github.com/volcengine/verl/pull/2614
[doc] style: change resize handle from gradient to plain color by @Tingberer in https://github.com/volcengine/verl/pull/2746
refactor: Make sure to keep the type checking by @YeonwooSung in https://github.com/volcengine/verl/pull/2634
[rollout] feat: remove chat scheduler by @wuxibin89 in https://github.com/volcengine/verl/pull/2725
[perf] feat: add optional role selection in discrete mode for NPU Profiler by @tongtong0613 in https://github.com/volcengine/verl/pull/2750
[doc] feat: add retool blog by @eric-haibin-lin in https://github.com/volcengine/verl/pull/2761
[algo] refactor: don't special-case compute_policy_loss by @frrad in https://github.com/volcengine/verl/pull/2701
[BREAKING] [rollout] chore: remove default rollout selection by @vermouth1992 in https://github.com/volcengine/verl/pull/2757

New Contributors

@NKcqx made their first contribution in https://github.com/volcengine/verl/pull/2685
@HelloWorld686 made their first contribution in https://github.com/volcengine/verl/pull/2484
@wizardlancet made their first contribution in https://github.com/volcengine/verl/pull/2726
@Tingberer made their first contribution in https://github.com/volcengine/verl/pull/2577
@MikeDean2367 made their first contribution in https://github.com/volcengine/verl/pull/2768
@kibitzing made their first contribution in https://github.com/volcengine/verl/pull/2777
@MzeroMiko made their first contribution in https://github.com/volcengine/verl/pull/2795
@clearhanhui made their first contribution in https://github.com/volcengine/verl/pull/2805
@panf2333 made their first contribution in https://github.com/volcengine/verl/pull/2849
@chi2liu made their first contribution in https://github.com/volcengine/verl/pull/2827
@wantbook-book made their first contribution in https://github.com/volcengine/verl/pull/2666
@Qiao0124 made their first contribution in https://github.com/volcengine/verl/pull/2476
@techkang made their first contribution in https://github.com/volcengine/verl/pull/2883
@looput made their first contribution in https://github.com/volcengine/verl/pull/2353
@EasonZhong668 made their first contribution in https://github.com/volcengine/verl/pull/2884
@TomQunChao made their first contribution in https://github.com/volcengine/verl/pull/2808
@ethen8181 made their first contribution in https://github.com/volcengine/verl/pull/2050
@wlf-darkmatter made their first contribution in https://github.com/volcengine/verl/pull/2602
@nariaki3551 made their first contribution in https://github.com/volcengine/verl/pull/2957
@xylcbd made their first contribution in https://github.com/volcengine/verl/pull/2430
@RasulAlakbarli made their first contribution in https://github.com/volcengine/verl/pull/2900
@zdhNarsil made their first contribution in https://github.com/volcengine/verl/pull/2881
@MrAta made their first contribution in https://github.com/volcengine/verl/pull/2868
@liqiongyu made their first contribution in https://github.com/volcengine/verl/pull/2985
@HaochenYuan made their first contribution in https://github.com/volcengine/verl/pull/3007
@Maxwell-Jia made their first contribution in https://github.com/volcengine/verl/pull/2398
@philippnormann made their first contribution in https://github.com/volcengine/verl/pull/3029
@JingchengYang4 made their first contribution in https://github.com/volcengine/verl/pull/3036
@codemayq made their first contribution in https://github.com/volcengine/verl/pull/3053

Full Changelog: https://github.com/volcengine/verl/compare/v0.5.0...v0.6.0

verl

More Python Projects

AutoGPT

stable-diffusion-webui

transformers

yt-dlp

More Python Projects

AutoGPT

stable-diffusion-webui

transformers

yt-dlp

v0.6.0: model engine, rollout server, composability

Highlights

Model Engine

Rollout Server

Newly Supported Models

Algorithm

Recipe

Breaking changes and deprecations

nD Dispatch method

ShardingManager

Importance bug fixes

What's Changed

New Contributors

What's Changed

New Contributors