Unclaimed project
Are you a maintainer of onnxruntime ? Claim this project to take control of your public changelog and roadmap.
Claim this project Changelog
onnxruntime ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
ai-framework deep-learning hardware-acceleration machine-learning neural-networks onnx +3
Last updated about 4 hours ago
ยฉ 2026 AnnounceHQ. All rights reserved.
Back to changelogNew February 6, 2026
ONNX Runtime v1.24.1 ๐ข Announcements & Breaking Changes
Platform Support Changes
Python 3.10 wheels are no longer published โ Please upgrade to Python 3.11+
Python 3.14 support added
Free-threaded Python (PEP 703) โ Added support for Python 3.13t and 3.14t in Linux (#26786 )
x86_64 binaries for macOS/iOS are no longer provided and minimum macOS is raised to 14.0
API Version
ORT_API_VERSION updated to 24 (#26418 )
โจ New Features
๐ค Execution Provider (EP) Plugin API A major infrastructure enhancement enabling plugin-based EPs with dynamic loading:
Initial kernel-based EP support (#26206 )
Weight pre-packing support for plugin EPs (#26754 )
EP Context model support (#25124 )
Control flow kernel APIs (#26927 )
OrtKernelInfo APIs for kernel-based plugin EPs (#26803 )
๐ง Core APIs
OrtApi::CreateEnvWithOptions() and OrtEpApi::GetEnvConfigEntries() (#26971 )
EP Device Compatibility APIs (#26922 )
External Resource Importer API for D3D12 shared resources (#26828 )
Session config access from KernelInfo (#26589 )
๐ Dependencies & Integration
ONNX upgraded to 1.20.1 (#26579 )
Protobuf updated from 3.20.3 โ 4.25.8 (#26910 )
CUDA Graph enabled by default (#26929 )
๐ฅ๏ธ Execution Provider Updates
NVIDIA
CUDA EP: Flash Attention updates, GQA kernel fusion, BF16 support for MoE/qMoE/MatMulNBits, CUDA 13.0 support
TensorRT EP: Upgraded to TensorRT 10.14, automatic plugin loading, NVFP4 custom ops
TensorRT RTX EP: RTX runtime caching, CUDA graph support, BFloat16, memory-mapped engines
Qualcomm QNN EP
QNN SDK upgraded to 2.42.0 with new ops (RMSNorm, ScatterElements, GatherND, STFT, RandomUniformLike)
Gelu pattern fusion, LPBQ quantization support, ARM64 wheel builds, v81 device support
Intel & AMD
OpenVINO EP: Upgraded to 2025.4.1
VitisAI EP: External EP loader, compiled model compatibility API
MIGraphX EP: QuickGelu, multihead attention, QLinear pooling ops
ArmNN EP Arm is formally deprecating the Arm NN Execution Provider (EP) in ONNX Runtime. The Arm NN EP is still experimental and depends on technology that is no longer actively maintained. Keeping it available now only adds complexity and potential confusion for users.
Effective immediately, the Arm NN EP is deprecated and will no longer be maintained
All build options, documentation, and examples referencing ArmNN will be removed once the upstream change merges; the removal will appear in the first ONNX Runtime release that includes that change. We will confirm the release number as soon as it is known
Builds that still rely on Arm NN-specific options (for example --use_armnn) will fail after the change lands, so please adjust configurations in advance
๐ Web & JavaScript
WebGPU EP: Flash Attention optimizations, graph capture, Split-K MatMul, qMoE support, WGSL templates
WebNN EP: GQA local attention, GatherBlockQuantized, ConvInteger/MatMulInteger
Node.js/React Native: Node.js v22, JSI for React Native, JSPI build support
๐ง CPU Improvements
KleidiAI: SME1/SME2 Convolution and SGemm kernels, FP32 Gemv, Windows/Arm support
New ops: MoE/qMoE kernels, RotaryEmbeddings opset 23, LayerNorm/RMSNorm broadcasting
Platform support: S390x SIMD, LoongArch64 4-bit quantization, FP16 inference improvements
ARM NCHWc layout support: NCHWc layout support for potential performance improvement of Conv models. Needs building from source with --enable_arm_neon_nchwc to enable this feature (#25580 #26838 #26691 #26171 ). This feature may be turned ON by default in a future release based on community feedback.
ARM perf improvements: Dedicated depthwise conv kernel (#26688 ) and SiLU activation perf improvement (#26753 )
๐ Language Bindings
C#
.NET 9.0 MAUI targets (#26463 )
Python
add_external_initializers_from_files (#26012 )
Java
Auto EP and compile model support (#25131 )
OrtCompiledModelCompatibility (#26028 )
๐ Bug Fixes
Critical Fixes
DoS vulnerability in FuseReluClip (#26878 )
Security issue loading arbitrary files as external data (#26776 )
Memory leak fix for KernelContext_GetAllocator (#26883 )
Local Attention off-by-1 bug (#25927 )
EP-Specific Fixes
[QNN] Clip op with min/max from QDQ (#26601 )
[CoreML] Gather fp16 support (#26442 )
๐ Contributors Thanks to our 170 contributors for this release!
@fs-eire , @tianleiwu , @edgchen1 , @qjia7 , @yuslepukhin , @hariharans29 , @Honry , @qti-yuduo , @adrianlizarraga , @snnn , @eserscor , @vraspar , @xiaofeihan1 , @guschmue , @daijh , @quic-muchhsu , @qti-jkilpatrick , @tirupath-qti , @Jiawei-Shao , @qti-hungjuiw , @quic-ashwshan , @titaiwangms , @qti-mattsinc , @chilo-ms , @jchen10 , @xhcao , @skottmckay , @quic-calvnguy , @JonathanC-ARM , @Rohanjames1997 , @sushraja-msft , @jambayk , @adrastogi , @xenova , @quic-tirupath , @justinchuby , @HectorSVC , @kunal-vaishnavi , @wenqinI , @prathikr , @baijumeswani , @preetha-intel , @jatinwadhwa921 , @umangb-09 , @qti-ashwshan , @carzh , @bachelor-dou , @ranjitshs , @gedoensmax , @xadupre , @nenad1002 , @TedThemistokleous , @keshavv27 , @zpye , @jnagi-intel , @jiafatom , @mingyueliuh , @Colm-in-Arm , @borg323 , @chunghow-qti , @Craigacp , @BODAPATIMAHESH , @AlekseiNikiforovIBM , @hans00 , @thevishalagarwal , @MaanavD , @qti-kromero , @damdoo01-arm , @BoarQing , @naomiOvad , @yuhuchua-qti , @hadiFute , @vishalpandya1990 , @rivkastroh , @minfhong-qti , @kuanyul-qti , @xieofxie , @ankitm3k , @RyanMetcalfeInt8 , @MayureshV1 , @bopeng1234 , @vthaniel , @mdvoretc-intel , @ericcraw , @javier-intel , @saurabhkale17 , @sfatimar , @Kotomi-Du , @intbf , @n1harika , @TejalKhade28 , @gupta-pallavi , @cbourjau , @nieubank , @r-devulap , @wszqkzqk , @sanketkaleoss , @amancini-N , @fanchenkong1 , @meakbiyik , @hisham-hchowdhu , @shaoboyan091 , @Stonesjtu , @qwu16 , @wangw-1991 , @bonktree , @naetherm , @nikhilfujitsu , @Panxuefeng-loongson , @selenayang888 , @moyo1997 , @chwarr , @patryk-kaiser-ARM , @fdwr , @SavaLione , @shiyi9801 , @mcost45 , @aciddelgado , @prudhvi-qti , @Jonahcb , @lifang-zhang , @zhaoxul-qti , @gaugarg-nv , @cocotdf , @WangFengtu1996 , @orlmon01 , @weidu-tpvision , @theHamsta , @kevinch-nv , @XXXXRT666 , @movedancer , @melkap01-Arm , @KingSora , @urpetkov-amd , @junchao-loongson , @jixiongdeng , @wcy123 , @GrigoryEvko , @anujj , @peishenyan , @quic-ankus , @jchen351 , @yihonglyu , @satyajandhyala , @co63oc , @mschofie , @quic-ashigarg , @asoldano , @nproshun , @jiangzhaoming , @seungtaek94 , @liqunfu , @jaholme , @hanbitmyths , @quic-boyuc , @rM-planet , @qti-vaiskv , @AndreyOrb , @pkubaj , @xhan65 , @Jaswanth51 , @quic-hungjuiw , @jywu-msft , @mklimenk , @derdeljan-msft , @ianfhunter , @NingW101 , @feich-ms , @Akupadhye , @wschin