Howard,
I pasted below, the error message after a while of the hang I referred.
Regards,
Martín
-
A request has timed out and will therefore fail:
Operation: LOOKUP: orted/pmix/pmix_server_pub.c:345
Your job may terminate as a result of this problem. You may want to
adjust the MCA paramete
Hi Howard.
Thanks for the track in Github. I have run with mpirun without “master” in the
hostfile and runs ok. The hanging occurs when I run like a singleton (no
mpirun) which is the way I need to run. If I make a top in both machines the
processes are correctly mapped but hangued. Seems the M
Hi Martin,
I opened an issue on Open MPI's github to track this
https://github.com/open-mpi/ompi/issues/8005
You may be seeing another problem if you removed master from the host file.
Could you add the --debug-daemons option to the mpirun and post the output?
Howard
Am Di., 11. Aug. 2020 um 1