Unclaimed project

Are you a maintainer of BladeDISC? Claim this project to take control of your public changelog and roadmap.

Claim this project

Changelog

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

alibaba/BladeDISC

Mar 27, 2023

Compute intensive graph codegen

NV GPU

For CUDA platform, we have supported fusion of GEMM and its following element-wise ops (e.g. GELU, transpose), and then do codegen based on CUTLASS. Experiments show that this feature achieves up to 1.1x speedup for BERT model. Please set DISC_ENABLE_COMPUTE_INTENSIVE_FUSE=true if you want to try this feature.

AArch64

We introduced MLIR Transf...

Read full release & details

Dec 8, 2022

We released GPU AStitch optimization mainly last time in v0.2.0. Now we are proud to announce the release of BladeDISC v0.3.0.

Highlights

We have done the following things in the latest 6 months:

Initial support of PyTorch 2.0 compilation;
Contribute TorchToMHLO to [Torch-...

Read full release & details

May 11, 2022

Release 0.2.0

Performance Optimization

GPU stitch fusion

Make use of GPU shared memory to fuse reduce operator with its consumers into one kernel. It helps to accommodate complex memory-intensive computations (e.g., LayerNorm, SoftMax) into one kernel, reducing off-chip memory traffics and overhead of kernel scheduling and launching. It implements partial functions described in pap...

Read full release & details

Related Projects

mapbox-navigation-android

Mapbox Navigation SDK for Android

ToastFish

一个利用摸鱼时间背单词的软件。

barcodelib

C# Barcode Image Generation Library

JPProject.IdentityServer4.SSO

:lock: ASP.NET Core 3.1 Open Source SSO. Built within IdentityServer4 :key:

View all projects →

BladeDISC

BladeDISC 0.4.0

Compute intensive graph codegen

NV GPU

AArch64

BladeDISC 0.3.0: Announce PyTorch 2.0 Compilation Support

Highlights

BladeDISC 0.2.0

Release 0.2.0

Performance Optimization

GPU stitch fusion

Related Projects

mapbox-navigation-android

ToastFish

barcodelib

JPProject.IdentityServer4.SSO

BladeDISC 0.4.0

Compute intensive graph codegen

NV GPU

AArch64

BladeDISC 0.3.0: Announce PyTorch 2.0 Compilation Support

Highlights

BladeDISC 0.2.0

Release 0.2.0

Performance Optimization

GPU stitch fusion

Related Projects

mapbox-navigation-android

ToastFish

barcodelib

JPProject.IdentityServer4.SSO