Feature List
Type |
Name |
Description |
Constraints |
|
|---|---|---|---|---|
Recall algorithm |
KScaNN |
An inverted index-based vector retrieval algorithm that deeply optimizes index layout, algorithmic logic, and computing process to fully unlock the chip potential. |
|
|
KBest |
Optimizes the performance and precision of the nearest neighbor search by using methods such as quantization and NUMA scheduling, which are used for multi-dimensional vector approximate nearest neighbor search. |
|
||
KVecTurbo |
Quantifies and compresses high-dimensional vectors to quickly obtain the near neighbors of a query. In addition, KVecTurbo uses SIMD instructions to accelerate distance calculation for multidimensional vector nearest neighbor search. |
|
||
KRL |
Kunpeng Retrieval Library (KRL) is an operator library optimized for the Kunpeng platform to accelerate vector retrieval. KRL can accelerate Faiss-supported algorithms such as HNSW, PQFS, IVFPQ, and IVFPQFS by replacing operators. |
|
||
KNewPfordelta |
Kunpeng New PForDelta algorithm is a vectorized decompression algorithm that optimizes inverted index processing for superior search performance. |
|
||
Faiss |
The open-source Faiss algorithm library has been deeply optimized using key technologies such as vectorization, dimension-interleaved lookup and accumulation, and vector filtering and compression. In addition, FP16 interface support has been added for the hnsw algorithm. These enhancements significantly improve the similarity search and clustering efficiency across IVFFlat, IVFPQ, HNSW, PQFS, and IVFPQFS indexing algorithms. |
|
||
RaBitQ |
Based on the open-source RaBitQ code, the library is extended with Arm64 (AArch64) support, introducing FP16 precision optimization, NEON SIMD vectorization, assembly-level Lookup Table (LUT) acceleration, Spilling with Orthogonality-Amplified Residuals (SOAR) vector allocation, and ML-based adaptive nprobe. |
|
||
EmbeddingLookup |
Based on the open-source Monolith large-scale real-time recommendation system, its core Embedding Lookup module has been deeply adapted and optimized. |
|
||
hnswlib |
The open-source hnswlib has been deeply optimized for the Arm architecture. It delivers efficient FP16 support through vectorization, and leverages optimization policies such as prefetching and instruction rescheduling. |
|
||
Ranking-focused AI library |
KDNN |
An acceleration operator library used for the AI framework. |
|
|
KDNN_EXT |
Use the Cython framework provide Python interfaces, making it more suitable for user scenarios. |
|
||
KTFOP |
A core operator library for TensorFlow. |
|
||
TensorFlow Serving thread scheduling optimization |
The TensorFlow Serving thread scheduling optimization feature improves the TensorFlow operator scheduling algorithm and adds other thread management optimizations, effectively improving the model inference throughput in high-concurrency scenarios. |
|
||
ANNC |
TensorFlow leverages the Accelerated Neural Network Compiler (ANNC) to perform graph-level optimizations, enhancing inference performance in recommendation systems. ANNC provides optimization technologies including computational graph optimization, and generation and integration of high-performance fused operators. |
|
||
The test results of the preceding algorithm features and performance metrics are based on the OS and compiler versions listed in the preceding table. The performance in other OSs or compiler environments is not verified.