v1.4.2
This release brings significant performance improvements, expanded SIMD support, and better cross-platform compatibility. New Features
AVX-512 SIMD Support
- Added ultra-high-performance AVX-512 search for patterns up to 64 bytes
- Automatic detection and utilization of AVX-512 instructions on supported CPUs
- Graceful fallback to AVX2/SSE4.2 on older hardware Enhanced Memory Performance
- Added prefetching (__builtin_prefetch) in search functions for better cache utilization
- Reduced MIN_CHUNK_SIZE to 2MB for improved parallelism on multi-core systems
- Added compiler optimization hints (LIKELY/UNLIKELY, HOT_FUNCTION) Thread Pool Improvements
- Adaptive mutex using PTHREAD_MUTEX_ADAPTIVE_NP where available
- Reduced thread stack size to 256KB for lower memory overhead
- Added batch task submission for improved efficiency
- Smarter thread count selection (cores - 1 for system headroom)