New
v1.2.1
Added
- Support more graph optimizations : Convolution+Convolution, LayerNorm
- Support more operators: ROIAlign, GenerateProposals, Reciprocal, Not, Log, ReductionL2, InstanceNorm, Expand, Gather, Scatter
- Support more operators(PReLU) process NCHW input data.
- Support ONNX share weight between Linear, MatMul, Gemm and Gather
- Support more networks on CPU: vision transformer(ViT, TNT), recommendation networks
- Support more networks on GPU : ASR, Faster_RCNN
- Support Armv7 int8 to accelerate NLP network(50%+ speed-up)
- Support X86 AVX512 int8 to accelerate NLP network(3x+ speed-up)
- Support using image on Qualcomm GPU, add GPU image manage methods
- Improve inference performance on Qualcomm GPU
- Add more kit android/iOS demos : Chinese ASR, Face Detection,