v1.25

🚀 New Features

Model Support

• Added support for Qwen3VL (now using static ViT)

• Enabled Janus model compatibility

• Improved MiniCPM4 model handling

• Enhanced VLM dynamic compilation for ViT models

• Added support for bm1690e platform

• Introduced Lightstereo model support

Core Optimizations

• Implemented ppl-based codegen for bm1690 ops

• Added HdimIsBatch mm pattern cases

• Enhanced auto-round support for LLM models

• Improved AddPostprocess with additional information

• Added same_addr parameter in model_deploy

• CV184x now supports LSTM BF16 operations

New Functionality

• Added new rewriter pattern for optimization

• Implemented dynamic MLIR inference for more ops

• Improved qtable search functions with broader application

• Added manual updates for calibration documentation

• Introduced quant.drawio documentation

⚡ Performance Improvements

• Optimized FAttention with keep_dims attribute

• Enhanced Qwen3VL with static ViT implementation

• Improved dynamic codegen for UpSample and other ops

• Refined shape pattern search for qtable optimization

• Strengthened Rope operation logic

🐛 Critical Fixes

Hardware Specific Fixes

• Fixed BM1688 GELU operation (now uses F32)

• Addressed CV184x GELU BF16 lowering

• Resolved BM1684 conv failure issues

• Fixed Mars3 global depthwise deconv bug

• Corrected CV184x interp input coord (now uses FP32/UINT32)

• Fixed TPUReshapeReorderPattern bug

Model Specific Fixes

• Fixed fattention head slice error

• Addressed multi-deform_attn for broader cases

• Resolved ppl cv184x integration issues

• Fixed cv184x codegen issues

• Corrected bias_correction conv bug

Toolchain Improvements

• Fixed io placement in io_alone mode

• Addressed memory leaks in repeated invoke calls

• Resolved MLIR-693 qtable optimization

• Fixed final qtable bugs

• Corrected tune_num=0 issue in search qtable

• Addressed tpulang no_save mode logging issues

• Fixed inplaceOp liveRange issues

• Resolved dynamic local codegen for ReduceOp

🛠️ Infrastructure Updates

Development Tools

• Added test cases for bm1690e ops and models

• Improved manual documentation

• Enhanced backend support for bm1684x/bm1688

• Removed deprecated 8ch interleave code

• Added shape parameter in set_tensor

• Fixed afterlayergroup pass no_save errors

Documentation

• Updated release records

• Added quant.drawio documentation

• Improved manual updates for calibration

• Fixed various documentation issues

tpu-mlir

Related Projects

mapbox-navigation-android

ToastFish

barcodelib

haze

Related Projects

mapbox-navigation-android

ToastFish

barcodelib

haze