December Release + 3x Faster Training

Thanks for all the love and support this year! We're wishing you all a lovely Christmas. Please update Unsloth & our Docker to use the latest updates! 🦥
Unsloth December Release

Introducing 3x faster training & 30% less VRAM. New Triton kernels, padding-free & packing. Blog
500K Context training and reinforcement learning is now possible on a single 80GB GPU. Blog • Notebook
Fine-tune then Deploy LLMs on your Phone with PyTorch and Unsloth. Tweet • Read Guide
🤗 Transformers v5 is now supported! It's not enabled by default due to possible instability issues.
Preliminary multi-GPU support: DDP Guide (not representative of the official release early next year)
More: Sudoku RL nb • Paddle-OCR nb • New NVIDIA blog
Lots of bug fixes! See further below.

:crystal_ball: New Models + Guides

:sparkles:FunctionGemma: Google new 270M tool-calling LLM. Guide • Notebook
Nemotron 3: NVIDIA new 30B reasoning model. Guide • GGUF
Mistral: new coding & instruct VLMs. Ministral 3 • Devstral 2
GLM-4.6V: new vision models. Guide • 4.6V • 4.6V-Flash
More: Qwen3-Next • Mistral Large 3 • FLUX.2-dev

[!TIP] Update Unsloth via pip install --upgrade --force-reinstall --no-cache-dir --no-deps unsloth unsloth_zoo If you want PyTorch 2.9: pip install --upgrade unsloth unsloth_zoo

Bug Fixes and Enhancements

Supports rollout_func allowing multi turn RL to work
Supports vllm>=0.12.0 and efficient GRPO for it
Supports transformers>=5.0.0, first shown via our Ministral notebooks
Fix HuggingFace token logins not working for private repos
Fixes TorchAO and QAT not working during saving
Fixed DeepSeek OCR finetuning not loading finetuned models
Improved vision utilities for vision VLM finetuning

What's Changed

Fix llama tokenizer padding_side when using model.generate in inference mode by @dmsuehir in https://github.com/unslothai/unsloth/pull/3644
Fix indefinite article usage in comments and docstrings by @mk0walsk in https://github.com/unslothai/unsloth/pull/3648
fix rope_theta -> rope_parameters['rope_theta'] by @mmathew23 in https://github.com/unslothai/unsloth/pull/3651
Fix broken link for advanced pip installation in README by @gitpullpull in https://github.com/unslothai/unsloth/pull/3652
Fix: prevent load_in_fp8 kwarg from reaching Qwen3MoeForCausalLM constructor (Fix #3649) by @bhuvanprakash in https://github.com/unslothai/unsloth/pull/3654
make unsloth_tiled_mlp a from_pretrained arg by @mmathew23 in https://github.com/unslothai/unsloth/pull/3655
FIX set defualt [128, 128] insted of none by @ved1beta in https://github.com/unslothai/unsloth/pull/3658
Fix: Pass gradient_checkpointing parameter to model.for_training() by @sbhavani in https://github.com/unslothai/unsloth/pull/3659
[FIX] Vllm guided decoding params by @Datta0 in https://github.com/unslothai/unsloth/pull/3662
Vllm guided decoding by @Datta0 in https://github.com/unslothai/unsloth/pull/3663
Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3664
[pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in https://github.com/unslothai/unsloth/pull/3666
Update transformers version constraint in pyproject.toml by @noah1510 in https://github.com/unslothai/unsloth/pull/3689
[pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in https://github.com/unslothai/unsloth/pull/3694
Remove reload_weights rpc call from grpo trainer by @Datta0 in https://github.com/unslothai/unsloth/pull/3673
[Fix] [TRL] load_lora for multi line llm.chat/generate by @Datta0 in https://github.com/unslothai/unsloth/pull/3696
Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3698
SFT sample packing by @djsaunde in https://github.com/unslothai/unsloth/pull/3566
Auto-enable padding-free SFT by @djsaunde in https://github.com/unslothai/unsloth/pull/3672
[FIX] fbgemm version check by @Datta0 in https://github.com/unslothai/unsloth/pull/3704
Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3706
update TRL filter by @djsaunde in https://github.com/unslothai/unsloth/pull/3707
[intel] skip xpu fbgemm fp8 by @leizhenyuan in https://github.com/unslothai/unsloth/pull/3625
Mistral packing, train on completions only, simplifications by @djsaunde in https://github.com/unslothai/unsloth/pull/3709
Update torchao save by @metascroy in https://github.com/unslothai/unsloth/pull/3679
Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3720
[pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in https://github.com/unslothai/unsloth/pull/3731
Bug fixes by @danielhanchen in https://github.com/unslothai/unsloth/pull/3734
Update FUNDING.yml by @danielhanchen in https://github.com/unslothai/unsloth/pull/3736
Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3737
Fix Deepseek OCR Lora Model Load by @mmathew23 in https://github.com/unslothai/unsloth/pull/3738

Unsloth Zoo Changes

updates for vLLM compativility with lora by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/359
Nightly by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/355
Add logging to tiled mlp and fix target chunk size calculation by @mmathew23 in https://github.com/unslothai/unsloth-zoo/pull/361
Remove include_buffers from init_empty_weights by @pluesclues in https://github.com/unslothai/unsloth-zoo/pull/363
packed seq lengths token count correction by @djsaunde in https://github.com/unslothai/unsloth-zoo/pull/348
Configure ce target gb by @mmathew23 in https://github.com/unslothai/unsloth-zoo/pull/365
[FIX] vLLM LoRA extra vocab by @Datta0 in https://github.com/unslothai/unsloth-zoo/pull/367
Nightly by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/368
[FIX] vLLM local lora tensor loading by @Datta0 in https://github.com/unslothai/unsloth-zoo/pull/370
vllm lora_dir rename and make embedding padding optional by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/373
Bug fixes by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/375
Update e to error by @ChetanKrishna07 in https://github.com/unslothai/unsloth-zoo/pull/374
Vision utils decode image improvement by @mmathew23 in https://github.com/unslothai/unsloth-zoo/pull/372
[FIX] [DDP] Fix compile for distributed training by @Datta0 in https://github.com/unslothai/unsloth-zoo/pull/379
Nightly by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/382
update compiler for XLMRobertaModel by @electroglyph in https://github.com/unslothai/unsloth-zoo/pull/383
Fix Deepseek OCR Lora Model Load by @mmathew23 in https://github.com/unslothai/unsloth-zoo/pull/386
fix for non-generation models in transformers 5 by @electroglyph in https://github.com/unslothai/unsloth-zoo/pull/388

New Contributors

@dmsuehir made their first contribution in https://github.com/unslothai/unsloth/pull/3644
@gitpullpull made their first contribution in https://github.com/unslothai/unsloth/pull/3652
@bhuvanprakash made their first contribution in https://github.com/unslothai/unsloth/pull/3654
@ved1beta made their first contribution in https://github.com/unslothai/unsloth/pull/3658
@sbhavani made their first contribution in https://github.com/unslothai/unsloth/pull/3659
@noah1510 made their first contribution in https://github.com/unslothai/unsloth/pull/3689
@ChetanKrishna07 made their first contribution in https://github.com/unslothai/unsloth-zoo/pull/374
@electroglyph made their first contribution in https://github.com/unslothai/unsloth-zoo/pull/383

Full Changelog: https://github.com/unslothai/unsloth/compare/November-2025...December-2025

Bug Fixes and Enhancements

Supports rollout_func allowing multi turn RL to work

Supports vllm>=0.12.0 and efficient GRPO for it

Supports transformers>=5.0.0, first shown via our Ministral notebooks

Fix HuggingFace token logins not working for private repos

Fixes TorchAO and QAT not working during saving

Fixed DeepSeek OCR finetuning not loading finetuned models

Improved vision utilities for vision VLM finetuning

What's Changed

Fix llama tokenizer padding_side when using model.generate in inference mode by @dmsuehir in https://github.com/unslothai/unsloth/pull/3644

Fix indefinite article usage in comments and docstrings by @mk0walsk in https://github.com/unslothai/unsloth/pull/3648

fix rope_theta -> rope_parameters['rope_theta'] by @mmathew23 in https://github.com/unslothai/unsloth/pull/3651

Fix broken link for advanced pip installation in README by @gitpullpull in https://github.com/unslothai/unsloth/pull/3652

Fix: prevent load_in_fp8 kwarg from reaching Qwen3MoeForCausalLM constructor (Fix #3649) by @bhuvanprakash in https://github.com/unslothai/unsloth/pull/3654

make unsloth_tiled_mlp a from_pretrained arg by @mmathew23 in https://github.com/unslothai/unsloth/pull/3655

FIX set defualt [128, 128] insted of none by @ved1beta in https://github.com/unslothai/unsloth/pull/3658

Fix: Pass gradient_checkpointing parameter to model.for_training() by @sbhavani in https://github.com/unslothai/unsloth/pull/3659

[FIX] Vllm guided decoding params by @Datta0 in https://github.com/unslothai/unsloth/pull/3662

Vllm guided decoding by @Datta0 in https://github.com/unslothai/unsloth/pull/3663

Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3664

[pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in https://github.com/unslothai/unsloth/pull/3666

Update transformers version constraint in pyproject.toml by @noah1510 in https://github.com/unslothai/unsloth/pull/3689

[pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in https://github.com/unslothai/unsloth/pull/3694

Remove reload_weights rpc call from grpo trainer by @Datta0 in https://github.com/unslothai/unsloth/pull/3673

[Fix] [TRL] load_lora for multi line llm.chat/generate by @Datta0 in https://github.com/unslothai/unsloth/pull/3696

Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3698

SFT sample packing by @djsaunde in https://github.com/unslothai/unsloth/pull/3566

Auto-enable padding-free SFT by @djsaunde in https://github.com/unslothai/unsloth/pull/3672

[FIX] fbgemm version check by @Datta0 in https://github.com/unslothai/unsloth/pull/3704

Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3706

update TRL filter by @djsaunde in https://github.com/unslothai/unsloth/pull/3707

[intel] skip xpu fbgemm fp8 by @leizhenyuan in https://github.com/unslothai/unsloth/pull/3625

Mistral packing, train on completions only, simplifications by @djsaunde in https://github.com/unslothai/unsloth/pull/3709

Update torchao save by @metascroy in https://github.com/unslothai/unsloth/pull/3679

Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3720

[pre-commit.ci] pre-commit autoupdate by @pre-commit-ci[bot] in https://github.com/unslothai/unsloth/pull/3731

Bug fixes by @danielhanchen in https://github.com/unslothai/unsloth/pull/3734

Update FUNDING.yml by @danielhanchen in https://github.com/unslothai/unsloth/pull/3736

Nightly by @danielhanchen in https://github.com/unslothai/unsloth/pull/3737

Fix Deepseek OCR Lora Model Load by @mmathew23 in https://github.com/unslothai/unsloth/pull/3738

Unsloth Zoo Changes

updates for vLLM compativility with lora by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/359

Nightly by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/355

Add logging to tiled mlp and fix target chunk size calculation by @mmathew23 in https://github.com/unslothai/unsloth-zoo/pull/361

Remove include_buffers from init_empty_weights by @pluesclues in https://github.com/unslothai/unsloth-zoo/pull/363

packed seq lengths token count correction by @djsaunde in https://github.com/unslothai/unsloth-zoo/pull/348

Configure ce target gb by @mmathew23 in https://github.com/unslothai/unsloth-zoo/pull/365

[FIX] vLLM LoRA extra vocab by @Datta0 in https://github.com/unslothai/unsloth-zoo/pull/367

Nightly by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/368

[FIX] vLLM local lora tensor loading by @Datta0 in https://github.com/unslothai/unsloth-zoo/pull/370

vllm lora_dir rename and make embedding padding optional by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/373

Bug fixes by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/375

Update e to error by @ChetanKrishna07 in https://github.com/unslothai/unsloth-zoo/pull/374

Vision utils decode image improvement by @mmathew23 in https://github.com/unslothai/unsloth-zoo/pull/372

[FIX] [DDP] Fix compile for distributed training by @Datta0 in https://github.com/unslothai/unsloth-zoo/pull/379

Nightly by @danielhanchen in https://github.com/unslothai/unsloth-zoo/pull/382

update compiler for XLMRobertaModel by @electroglyph in https://github.com/unslothai/unsloth-zoo/pull/383

Fix Deepseek OCR Lora Model Load by @mmathew23 in https://github.com/unslothai/unsloth-zoo/pull/386

fix for non-generation models in transformers 5 by @electroglyph in https://github.com/unslothai/unsloth-zoo/pull/388

New Contributors

@dmsuehir made their first contribution in https://github.com/unslothai/unsloth/pull/3644

@gitpullpull made their first contribution in https://github.com/unslothai/unsloth/pull/3652

@bhuvanprakash made their first contribution in https://github.com/unslothai/unsloth/pull/3654

@ved1beta made their first contribution in https://github.com/unslothai/unsloth/pull/3658

@sbhavani made their first contribution in https://github.com/unslothai/unsloth/pull/3659

@noah1510 made their first contribution in https://github.com/unslothai/unsloth/pull/3689

@ChetanKrishna07 made their first contribution in https://github.com/unslothai/unsloth-zoo/pull/374

@electroglyph made their first contribution in https://github.com/unslothai/unsloth-zoo/pull/383

Full Changelog: https://github.com/unslothai/unsloth/compare/November-2025...December-2025

unsloth

:crystal_ball: New Models + Guides

Bug Fixes and Enhancements

What's Changed

Unsloth Zoo Changes

New Contributors

More Python Projects

AutoGPT

stable-diffusion-webui

transformers

yt-dlp

December Release + 3x Faster Training

:crystal_ball: New Models + Guides

Bug Fixes and Enhancements

What's Changed

Unsloth Zoo Changes

New Contributors

More Python Projects

AutoGPT

stable-diffusion-webui

transformers

yt-dlp