v1.2.0
Gazelle 1.2.0 is the 2nd release from the repository oap-project(https://github.com/oap-project/gazelle_plugin). In this release, we have implemented 11 new features support, fix 11 performance issues & bugs, and merged total of 48 PRs.
Here is the major highlight in Gazelle 1.2.0:
- 1.25X speed up on TPC-DS 103 queries.
- Add RDD Cache support
- Add Spill & UDF Support
- Implement native Column to Row optimization
- Further enhances the stability of performance for many fall back cases.
Gazelle Plugin
Features
|No.|Description| |:---|:---| |#394|Support ColumnarArrowEvalPython operator | |#368|Encountered Hadoop version (3.2.1) conflict issue on AWS EMR-6.3.0| ||Implement a series of datetime functions| ||Add Date/Timestamp type support| ||make arrow-unsafe allocator as the default| ||configurable codegen opt level| ||Arrow Data Source: CSV format support fix| ||Add Parquet write support to Arrow data source| ||Add build option to enable unsafe Arrow allocator| ||UDF: Add test case for validating basic row-based udf| ||Update Scala unit test to spark-3.1.1|