模型接口类别 |
函数接口 |
---|---|
ML API |
def fit(dataset: Dataset[_]): KNNModel def transform(dataset: Dataset[_]): DataFrame |
Param name |
Type(s) |
Description |
---|---|---|
dataset |
Dataset[_] |
包含样本特征的DF |
k |
Int |
近邻数 |
Param name |
Type(s) |
Default |
Description |
---|---|---|---|
setFeaturesCol(value :String |
String |
features |
训练集特征列名 |
setAuxiliaryCols(valu e:Array[String]) |
Array[Str ing] |
Array.empty[St ring] |
训练集附加列列名 |
Param name |
Type(s) |
Default |
Description |
---|---|---|---|
setFeaturesCol(v alue: String) |
String |
features |
测试集特征列名 |
setNeighborsCo l(value:String) |
String |
neighbors |
邻居附加列列名 |
setDistanceCol( value: String) |
String |
distances |
邻居距离列名 |
setK(value: Int) |
Int |
1 |
近邻数 |
setTestBatchSiz e(value: Int) |
Int |
1024 |
搜索Batch大小 |
参数及fit代码接口示例:
val model = new KNN() .setFeaturesCol(featuresCol) .setAuxiliaryCols(Array("id")) .fit(trainDataDF)
Param name |
Type(s) |
Description |
---|---|---|
dataset |
Dataset[_] |
包含k近邻距离和附加列的DF |
● 使用样例
val model = new KNN() .setFeaturesCol(featuresCol) .setAuxiliaryCols(Array("id")) .fit(trainDataDF) val testResults = model .setFeaturesCol(featuresCol) .setNeighborsCol(neighborsCol) .setDistanceCol(distanceCol) .setK(k) .setTestBatchSize(testBatchSize) .transform(testDataDF)