Rate This Document
Findability
Accuracy
Completeness
Readability

Setting Up the Development Environment

Client Environment

Table 1 describes the client environment requirements.

Table 1 Client environment requirements

Item

Version

Remarks

OS

Windows 7 or later

Prepare it in advance.

Installing JDK

Install BiSheng JDK 1.8.0_342.

See Creating a Project.

Installing and configuring the development tool

Eclipse or IntelliJ IDEA is recommended. This document uses IntelliJ IDEA (2018.2) as an example.

Prepare it in advance.

Installing Scala

Complete the basic configuration for the Scala environment. Scala 2.12.11 is recommended for Spark 3.3.1.

See Creating a Project.

Installing Maven

Compile the project package. Recommended version: 3.6.3.

See Creating a Project.

Obtaining the Software

Table 2 describes how to obtain the library package of a machine learning algorithm.

Table 2 How to obtain the library packages

Applicable Spark Version

Software Package and URL

Remarks

3.3.1

Contact Huawei technical support.

Before using the software package, read and agree to Kunpeng BoostKit User License Agreement 2.0.

boostkit-ml-acc_2.12-3.0.0-spark3.3.1.jar

boostkit-ml-core_2.12-3.0.0-spark3.3.1.jar

boostkit-ml-kernel-client_2.12-3.0.0-spark3.3.1.jar

  • For details about how to compile the packages, see Compiling the Code in the Big Data Machine Learning Algorithm Library Feature Guide.
  • boostkit-ml-acc_2.XX-XXX-sparkXX.jar

    It is required for software running and must be deployed.

  • boostkit-ml-core_2.XX-XXX-sparkXX.jar

    It is required for software running and must be deployed.

  • boostkit-ml-kernel-client_2.XX-XXX-sparkXX.jar

    It is required for software compilation and does not need to be deployed.

Verifying Software Package Integrity

After downloading a software package from the Kunpeng community, verify the software package to ensure that it is consistent with the original one on the website.

Verify a software package as follows:

  1. Obtain the digital certificate and software.
  2. Obtain the verification tool and method from the following link:
  3. Verify the package integrity by following the procedure described in the OpenPGP Signature Verification Guide obtained from the URL.

Cluster Environment

Prepare the required cluster environment before algorithm development. Table 3 lists the required software versions.
Table 3 Recommended software configurations in the cluster environment

Item

Node Type

Requirement

OS

All nodes

openEuler 22.03 LTS SP1

JDK

All nodes

BiSheng JDK 1.8.0_342

ZooKeeper

Compute node

3.6.2

Hadoop

All nodes

3.2.0

Spark

All nodes

Spark 3.3.1

The Kunpeng algorithm library is compatible with Spark 3.3.1. Other platforms are not verified. For security purposes, you are advised to use a later version.