Release v1.0.24
pytorch-image-models v1.0.24
Breaking Changes
- Removed APEX AMP support
Features
- New models: CSATv2 (lightweight, high-res with DCT stem & spatial attention) + ~21M param variant at 512x512 & 640x640
- New optimizers: AdaMuon and NAdaMuon support added to existing Muon implementation
- Differential Attention: New
DiffAttentionmodule +DiffParallelScalingBlockfor ViT; trained small ViT variants (dwee,dpwee) - Pooling modules: Added
LsePlusandSimPool - Task abstraction: Lightweight task API with logits and feature distillation support in train script
init_non_persistent_buffers()method for externally calling buffer init after meta-device initialization
Fixes
- Fixed legacy
timm.models.layersimport path that broke deprecated imports (v1.0.24 patch) - Fixed autocasting in fast normalization functions to handle optional weight params safely
- Optimized
DropBlock2d+ added support to ByobNet-based models
Improvements
- Test coverage bumped to PyTorch 2.9.1 + Python 3.13 (upper), PyTorch 1.13 + Python 3.10 (lower)
- Added benchmark CSVs for inference timing (RTX Pro 6000, 5090, 4090 with PyTorch 2.9.1)
- New pre-trained weights: NAdaMuon-trained differential ViTs with improved accuracy
- Type hints added to
CosineLRSchedulerand other scheduler classes - Docstrings added to layer helper functions and modules
Jan 5 & 6, 2025
- Patch Release 1.0.24 (fix for 1.0.23)
- Add new benchmark result csv files for inference timing on all models w/ RTX Pro 6000, 5090, and 4090 cards w/ PyTorch 2.9.1
- Fix moved module error in deprecated timm.models.layers import path that impacts legacy imports
- Release 1.0.23
Dec 30, 2025
- Add better NAdaMuon trained
dpwee,dwee,dlittle(differential) ViTs with a small boost over previous runs- https://huggingface.co/timm/vit_dlittle_patch16_reg1_gap_256.sbb_nadamuon_in1k (83.24% top-1)
- https://huggingface.co/timm/vit_dwee_patch16_reg1_gap_256.sbb_nadamuon_in1k (81.80% top-1)
- https://huggingface.co/timm/vit_dpwee_patch16_reg1_gap_256.sbb_nadamuon_in1k (81.67% top-1)
- Add a ~21M param
timmvariant of the CSATv2 model at 512x512 & 640x640- https://huggingface.co/timm/csatv2_21m.sw_r640_in1k (83.13% top-1)
- https://huggingface.co/timm/csatv2_21m.sw_r512_in1k (82.58% top-1)
- Factor non-persistent param init out of
__init__into a common method that can be externally called viainit_non_persistent_buffers()after meta-device init.
Dec 12, 2025
- Add CSATV2 model (thanks https://github.com/gusdlf93) -- a lightweight but high res model with DCT stem & spatial attention. https://huggingface.co/Hyunil/CSATv2
- Add AdaMuon and NAdaMuon optimizer support to existing
timmMuon impl. Appears more competitive vs AdamW with familiar hparams for image tasks. - End of year PR cleanup, merge aspects of several long open PR
- Merge differential attention (
DiffAttention), add correspondingDiffParallelScalingBlock(for ViT), train some wee vits- https://huggingface.co/timm/vit_dwee_patch16_reg1_gap_256.sbb_in1k
- https://huggingface.co/timm/vit_dpwee_patch16_reg1_gap_256.sbb_in1k
- Add a few pooling modules,
LsePlusandSimPool - Cleanup, optimize
DropBlock2d(also add support to ByobNet based models)
- Merge differential attention (
- Bump unit tests to PyTorch 2.9.1 + Python 3.13 on upper end, lower still PyTorch 1.13 + Python 3.10
Dec 1, 2025
- Add lightweight task abstraction, add logits and feature distillation support to train script via new tasks.
- Remove old APEX AMP support
What's Changed
- Add val-interval argument by @t0278611 in https://github.com/huggingface/pytorch-image-models/pull/2606
- Add coord attn and some variants that I had lying around by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2617
- Distill fixups by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2598
- A simplification and some fixes for DropBlock2d. by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2620
- Other pooling... by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2621
- Experimenting with differential attention by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2314
- Differential + parallel attn by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2625
- AdaMuon impl w/ a few other ideas based on recent reading by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2626
- Csatv2 contribution by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2627
- Add HParams sections to hfdocs by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2630
- Upgrade GitHub Actions for Node 24 compatibility by @salmanmkc in https://github.com/huggingface/pytorch-image-models/pull/2633
- [BUG] Modify autocasting in fast normalization functions to handle optional weight params safely by @tesfaldet in https://github.com/huggingface/pytorch-image-models/pull/2631
- 'init_non_persistent_buffers' scheme by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2632
- Add docstrings to layer helper functions and modules by @raimbekovm in https://github.com/huggingface/pytorch-image-models/pull/2634
- refactor(scheduler): add type hints to CosineLRScheduler by @haru-256 in https://github.com/huggingface/pytorch-image-models/pull/2640
- A few misc weights to close out 2025 by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2639
- Update typing in other scheduler classes. Add unit tests. by @rwightman in https://github.com/huggingface/pytorch-image-models/pull/2641
New Contributors
- @t0278611 made their first contribution in https://github.com/huggingface/pytorch-image-models/pull/2606
- @salmanmkc made their first contribution in https://github.com/huggingface/pytorch-image-models/pull/2633
- @tesfaldet made their first contribution in https://github.com/huggingface/pytorch-image-models/pull/2631
- @raimbekovm made their first contribution in https://github.com/huggingface/pytorch-image-models/pull/2634
- @haru-256 made their first contribution in https://github.com/huggingface/pytorch-image-models/pull/2640
Full Changelog: https://github.com/huggingface/pytorch-image-models/compare/v1.0.22...v1.0.24