v4.0
v4.0 is a significant release with new features and improvements. AVX-512 sorting methods gain up to 2x perf improvements and we have added AVX2 sorting methods to support a wider range of x86 processors. In additional to using it as a header file library, x86-simd-sort can be installed as a library, and it provides API access to various sorting methods with automatic runtime dispatch to select the fastest version based on the processor. Here is a quick summary of all the changes:
- Added AVX2 implementations of
avx2_qsort,avx2_qselectandavx2_partial_qsortfor 32-bit and 64-bit data types. When compared tostd::sort, these are up to 12x faster for 32-bit data and up to 7x faster for 64-bit data. - x86-simd-sort can now be built and installed as a shared library. The library provides runtime dispatch and automatically picks the fastest version among AVX-512/AVX2/scalar depending on the processor it is run on. Starting with
clearlinuxv40270, you can install x86-simd-sort with .