Change Description

The OmniRuntime OmniOperator feature of Kunpeng BoostKit for Big Data uses a unified infrastructure to support different engines (such as Spark), reducing repeated optimization work, fully exploring common and heterogeneous computing power, and promoting the Kunpeng ecosystem.

New Features

Improved the performance of the 99 TPC-DS benchmark queries by 30%. The optimizations include vectorized computing of AVG/SUM aggregators, sort spills based on memory usage, shuffle write for fewer spills in TMP files, and join reorder without CBO.
Added TopNSort operators and Sort-Merge Join and Sort integration.
Added the table scan native processing for Parquet files, and security clusters for ORC and Parquet.

Modified Features

None

Removed Features

None

Parent topic: V1.3.0