Introduction
Product Description
BulkLoad imports data to HBase efficiently.
Related Concepts
HBase
HBase is an open source, non-relational distributed database modeled after Google's BigTable and written in Java. As part of the Hadoop project of the Apache Software Foundation, it runs on the HDFS and provides BigTable-like services for Hadoop. HBase provides a high fault-tolerance capability for sparse data.
Principles
HBase BulkLoad provides the following functions:
- Prepare data through MapReduce tasks.
- Load data to HBase.
Parent topic: BulkLoad Usage Guidance (for HBase)