Architecture
Kunpeng BoostKit for Big Data supports multiple big data platforms and application scenarios such as offline analysis, real-time search, and real-time stream processing.
Real-time search indicates querying a large amount of real-time written data based on primary index keys in real time. The query has high requirements on response time whereas the query conditions are relatively simple. If the query terms are complicated, search for the primary index keys using keywords in all-domain data and then use the primary index keys for query. All-domain data includes structured and text data. Its typical features are as follows:
- High requirements for millisecond-level query response time
- High concurrency
- Up to petabytes of data to be processed
- Simultaneous processing of structured and unstructured data
- Full-text search
- Near-real-time index
Figure 1 shows the system architecture of real-time search.
|
Name |
Description |
|---|---|
|
Data source |
The data source types include file data (such as TXT and CSV) and stream data (such as Socket flows and OGG log flows). |
|
Data collection system |
|
|
Real-time search engine |
|
|
Service applications |
Real-time search applications developed by ISVs based on Elasticsearch, HBase APIs, and RESTful APIs. |
