Rate This Document
Findability
Accuracy
Completeness
Readability

Change Description

The OmniRuntime OmniOperator feature of Kunpeng BoostKit for Big Data uses a unified infrastructure to support different engines (such as Spark), reducing repeated optimization work, fully exploring common and heterogeneous computing power, and promoting the Kunpeng ecosystem.

New Features

  • Improved the performance of the 99 TPC-DS benchmark queries by 30%. The optimizations include vectorized computing of AVG/SUM aggregators, sort spills based on memory usage, shuffle write for fewer spills in TMP files, and join reorder without CBO.
  • Added TopNSort operators and Sort-Merge Join and Sort integration.
  • Added the table scan native processing for Parquet files, and security clusters for ORC and Parquet.

Modified Features

None

Removed Features

None