Rate This Document
Findability
Accuracy
Completeness
Readability

Installing the Software (on openEuler 22.03)

To install OmniMV on openEuler 22.03, install Conda and create an OmniMV virtual environment. Then use pip to download the Python dependency library, including the dependency packages of the Spark and ClickHouse engines. After that, download and decompress the OmniMV recommendation algorithm software package, and remove other user groups' read and execute permissions.

  1. Download the Conda software package and activate the Conda virtual environment.
    1. Download Conda.
      1
      2
      cd /opt
      wget -k https://repo.anaconda.com/archive/Anaconda3-2022.05-Linux-aarch64.sh
      
    2. Install Conda, for example, in the /opt/anaconda3 directory.
      1
      bash Anaconda3-2022.05-Linux-aarch64.sh
      
    3. Initialize Conda.
      1
      source /opt/anaconda3/etc/profile.d/conda.sh
      
    4. Create a Conda virtual environment named omnimv.
      1
      conda create -n omnimv python=3.10
      
    5. Activate the Conda virtual environment.
      1
      conda activate omnimv
      
  2. Use pip to download the Python dependencies.
    • Spark dependency packages
       1
       2
       3
       4
       5
       6
       7
       8
       9
      10
      pip install Cython==0.29.24
      pip install numpy==1.22.3
      pip install pandas==1.3.5
      pip install PyYAML==6.0
      pip install mo-sql-parsing
      pip install sql-metadata==2.6.0
      pip install scikit-learn==1.1.2
      pip install xgboost==1.7.3
      pip install pyspark==3.1.1
      pip install pydoop==2.0.0
      
    • ClickHouse engine dependency packages
      1
      2
      3
      4
      5
      6
      7
      8
      9
      pip install Cython==0.29.24
      pip install numpy==1.22.3
      pip install pandas==1.3.5
      pip install PyYAML==6.0
      pip install mo-sql-parsing
      pip install sql-metadata==2.6.0
      pip install clickhouse-driver==0.2.4
      pip install scikit-learn==1.1.2
      pip install xgboost==1.7.3
      
  3. Download the OmniMV recommendation algorithm software package.
    • Spark: Download the OmniMV recommendation algorithm software package, and save the JAR package of the Spark SQL plugin and log parser JAR package to the server node of the Spark cluster. For details about how to obtain the software packages, see Obtaining Software. There is no special requirement on the software package paths. For example, you can store them in /opt/omnimv.
      1
      2
      3
      4
      mkdir -p /opt/omnimv
      mv boostkit-omnimv-spark-3.1.1-1.1.0-aarch64.jar /opt/omnimv
      mv boostkit-omnimv-logparser-spark-3.1.1-1.1.0-aarch64.jar /opt/omnimv
      mv BoostKit-omnimv_1.1.0.zip /opt/omnimv
      
    • ClickHouse engine:
      1
      2
      mkdir -p /opt/omnimv
      mv BoostKit-omnimv_1.1.0.zip /opt/omnimv
      
  4. Decompress the OmniMV component package and remove the read and executable permissions of other user groups on the generated folder.
    1
    2
    3
    cd /opt/omnimv
    unzip BoostKit-omnimv_1.1.0.zip
    chmod -R o-r BoostKit-omnimv_1.1.0
    

    You are advised to decompress the package as a non-root user so that the generated directory has the minimum permission to prevent files from being replaced.