Rate This Document
Findability
Accuracy
Completeness
Readability

Dataset Description

You can use HiBench to generate required machine learning datasets for algorithm performance tests, or download datasets from the official website for preprocessing and then perform algorithm performance tests.

Table 1 Data generation methods

Item

Description

Generating datasets using HiBench

CP10M1K, CP2M5K, ALS, D200M100, D10M4096, HiBench_10M_200M, HibenchRating3wx3w, ECBDL14, BostonHousing, Titanic, avazu, Movielens, Taobao, Criteo40M, Criteo150M, bremenSmall, farm, house

Download datasets from the official website

house, HIGGS, nytimes, Kosarak, DEEP1B, Mnist8m, Epsilon, MESH_DEFORM