用户集群环境配置问题导致mpirun命令在集群环境中运行失败。
运行失败示例如下:
$ mpirun ~/AllReduce -------------------------------------------------------------------------- Sorry! You were supposed to get help about: opal_init:startup:internal-failure But I couldn't open the help file: /usr1/workspace/Version_pipeline_ompi_aarch64_gcc10.3.1_CentOS7.6_MLX4.9/ompi/build/../share/openmpi/help-opal-runtime.txt: No such file or directory. Sorry! --------------------------------------------------------------------------
“.bashrc”文件未配置“OPAL_PREFIX”环境变量。