Jan 28, 2026

Highlights

New CLI: mistralrs-cli
Prefix Caching: We have implemented Prefix Caching for PagedAttention (#1750). This significantly accelerates multi-turn conversations and RAG workflows by reusing KV cache for shared prompt prefixes.
Major model expanstion: Support for the Embedding Gemma, Qwen 3 Embedding, Gemma 3n, GLM-4, Granite Hybrid MoE, GLM-4 MoE, GLM-4 MoE Lite...

Read full release & details

Jun 10, 2025

Dockerfiles (CUDA, CPU): https://github.com/EricLBuehler/mistral.rs/pkgs/container/mistral.rs
PyPi packages (no features, cuda, mkl, metal, accelerate)

🔥 Highlights from v...

Read full release & details

Mar 24, 2025

Highlights

Blog post: https://huggingface.co/blog/EricB/mistralrs-v0-5-0

Thank you to all contributors for this release! This release includes the following highlights but also countless improvements, fixes, and optimizations.

Support for many more models:
- Gemma 3
- Qwen 2.5 VL
- Mistral Small 3.1
- Phi 4 Multimodal (image only)
Native tool calling support fo...

Read full release & details

Jan 22, 2025

New features

🔥 New models!
- DeepSeek V2
- DeepSeek V3 and R1
- MiniCpm-O 2.6
🧮 Imatrix quantization
⚙️ Automatic device mapping
BNB quantization
Support blockwise FP8 dequantization and FP8 on Metal
Integrate the llguidance library (@mmoskal)
Metal PagedAttention
Many fixes and improvements from contributors!

Breaking changes

The Rust device mapping...

Read full release & details

Nov 28, 2024

New features

Qwen2-VL support
Idefics 3/SmolVLM support
️‍🔥 6x prompt performance boost (all benchmarks faster than or comparable to MLX, llama.cpp)!
🗂️ More efficient non-PagedAttention KV cache implementation!
Public tokenization API

Python wheels

The wheels now include support for Windows, Linux, and Mac with x84_64 and aarch64.

MSRV

1.79.0

What's Changed...

Read full release & details

mistral.rs

Related Projects

mapbox-navigation-android

ToastFish

barcodelib

JPProject.IdentityServer4.SSO

v0.7.0

Highlights

v0.6.0

🔥 Highlights from v...

v0.5.0

Highlights

v0.4.0

New features

Breaking changes

v0.3.4

New features

Python wheels

MSRV

What's Changed...

Related Projects

mapbox-navigation-android

ToastFish

barcodelib

JPProject.IdentityServer4.SSO