Unclaimed project
Are you a maintainer of mscclpp? Claim this project to take control of your public changelog and roadmap.
Claim this projectChangelog
mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
Back to changelogNew
MSCCL++ v0.8.0
What's Changed
- Fix #458 by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/568
- Fix multinode test failure by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/574
- Separate linters from cmake by @chhwang in https://github.com/microsoft/mscclpp/pull/587
- Fix relaxedWait() by @chhwang in https://github.com/microsoft/mscclpp/pull/594
- NCCL fixes by @chhwang in https://github.com/microsoft/mscclpp/pull/592
- Updated Dev Container by @chhwang in https://github.com/microsoft/mscclpp/pull/591
- Support CudaIpc connection within a single process by @chhwang in https://github.com/microsoft/mscclpp/pull/593
- Fix GpuStreamPool to be aware of the device ID of streams by @chhwang in https://github.com/microsoft/mscclpp/pull/590
- update pytest and python API to fix ut failure by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/598
- Fixed the local channel test by @chhwang in https://github.com/microsoft/mscclpp/pull/597
- Use smart pointer for IB structure by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/585
- Update documentation by @chhwang in https://github.com/microsoft/mscclpp/pull/576
Support CUDA 12.9 by @chhwang in https://github.com/microsoft/mscclpp/pull/600Merge ChannelTrigger with ProxyTrigger by @chhwang in https://github.com/microsoft/mscclpp/pull/601MNNVL fix by @chhwang in https://github.com/microsoft/mscclpp/pull/604New DSL implementation by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/579python doc auto generation by @chhwang in https://github.com/microsoft/mscclpp/pull/605all2all implementation by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/609Fix ut by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/613Create ib mr for per ib transport by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/611Fix for multi-nodes test by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/614add torch test by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/612AlltoAll Test Support by @caiomcbr in https://github.com/microsoft/mscclpp/pull/606Adding Channel Id Field DSL Port Channel Operations by @caiomcbr in https://github.com/microsoft/mscclpp/pull/615Fix deadlock in Executor channel setup by @caiomcbr in https://github.com/microsoft/mscclpp/pull/616Fix NVLS correctness issue by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/618Fixed cpp linter by @chhwang in https://github.com/microsoft/mscclpp/pull/619Thread Block Group DSL by @caiomcbr in https://github.com/microsoft/mscclpp/pull/621Fix memory exchange within a single process by @chhwang in https://github.com/microsoft/mscclpp/pull/624Fix hang issue in logging submodule by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/625Integrate MSCCL++ with torch workload by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/626Add FifoDeviceHandle::poll() by @chhwang in https://github.com/microsoft/mscclpp/pull/630Fix Illegal Memory Access in nvls_test for CUDA12.9 by @abhijangda in https://github.com/microsoft/mscclpp/pull/631Adapt with torch 2.6 by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/632Fix for safe process teardown by @chhwang in https://github.com/microsoft/mscclpp/pull/633use unix socket to share fd by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/634Address teardown issue by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/638Revise NCCL API implementation by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/617Support detailed version tracking that captures git repository information by @seagater in https://github.com/microsoft/mscclpp/pull/639Fix Rocm build issue by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/642Add 2 Node AllReduce DSL Algorithm by @caiomcbr in https://github.com/microsoft/mscclpp/pull/636Make ncclReduce/ncclSend/ncclRecv work by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/643Reduce memory footprint for allreduce8 and allgather6 by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/644Add MSCCLPP_GIT_COMMIT micro by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/640Address corner case when generating version file by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/641Pipeline fix by @Binyang2014 in https://github.com/microsoft/mscclpp/pull/645New Contributors
- @abhijangda made their first contribution in https://github.com/microsoft/mscclpp/pull/631
Full Changelog: https://github.com/microsoft/mscclpp/compare/v0.7.0...v0.8.0