Failed to Exit a Parallel HPC Application Debugging Task
Symptom
The user fails to exit a parallel HPC application debugging task.
Figure 1 Environment cleanup failure
Figure 2 Too long waiting time for clearing the debugging environment
Figure 3 Failure to exit MPI debugging
Possible Causes
- Too many ranks are started.
- The network connectivity is poor.
Procedure
- (Optional) Manually delete the files in the displayed path, that is, xxx.
rm -f xxx
- Release process resources.
- Check the mpirun process.
ps -ef | grep mpirun
- Kill the process. pid indicates the process ID.
kill -15 {pid}
- Check the mpirun process.
- Restart the services.
systemctl restart gunicorn_framework.service systemctl restart gunicorn_plugin.service
Parent topic: FAQs