docs: update README.md by @eltociear in https://github.com/deepspeedai/DeepSpeed/pull/7781
Fix gradient checkpointing with use_reentrant=True / PyTorch-style backward / ZeRO-3 by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7780
Fix Ulysses PEFT test by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7784
Fix Evoformer compilation by @sdvillal in https://github.com/deepspeedai/DeepSpeed/pull/7760
fix checkpointing/loading of z0+bf16 by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7786
Add sequential allgather optimization for ZeRO-3 by @aeeeeeep in https://github.com/deepspeedai/DeepSpeed/pull/7661
Fix AutoTP test numerical tolerance with rtol by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7794
Fix backward for pipeline engine by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7787
Skip empty parameters in gradient reduction by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7789
Fix issue with BF16 optimizer selection by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7788
Fix BF16_Optimizer being used without ZeRO by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7790
Add full test suite workflow by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7795
Fix Muon optimizer module path by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7802
Fix ping-pong buffer index reset and removing redundant stream sync by @undersilence in https://github.com/deepspeedai/DeepSpeed/pull/7805
Fix ZeRO stage to choose BF16 optimizer in test by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7803
Run Evoformer tests sequentially by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7810
Improve engine's cleanup by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7813
Ignore evoformer test by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7815
Fix typos in accelerator setup guide by @nathon-lee in https://github.com/deepspeedai/DeepSpeed/pull/7818
Raise clear error on in-place GatheredParameters edits without modifier_rank by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7817
[Bugfix] Resolve Rank index out of range during BWD when sp_size < world_size in Ulysses by @Flink-ddd in https://github.com/deepspeedai/DeepSpeed/pull/7809
Update PyTorch to v2.9 for modal tests by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7816
New Contributors
@ooooo-create made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7779
@T1mn made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7769
@sdvillal made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7760
@undersilence made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7805
Full Changelog: https://github.com/deepspeedai/DeepSpeed/compare/v0.18.4...v0.18.5