Change Description
The OmniRuntime OmniOperator feature of Kunpeng BoostKit for Big Data uses a unified infrastructure to support different engines (such as Spark), reducing repeated optimization work, fully exploring common and heterogeneous computing power, and promoting the Kunpeng ecosystem.
New Features
- Improved the performance of the 99 TPC-DS benchmark queries by 30%. The optimizations include vectorized computing of AVG/SUM aggregators, sort spills based on memory usage, shuffle write for fewer spills in TMP files, and join reorder without CBO.
- Added TopNSort operators and Sort-Merge Join and Sort integration.
- Added the table scan native processing for Parquet files, and security clusters for ORC and Parquet.
Modified Features
None
Removed Features
None
Parent topic: V1.3.0