The messages about the daemons is coming from two different sources. Grid is
saying it was able to spawn the orted - then the orted is saying it doesn't
know how to communicate and fails.
I think the root of the problem lies in the plm output that shows the qrsh it
will use to start the job. Fo
Hi John,
We have Univa grid 8.6.7
I will get information about the network from our IT and respond back. The grid
has 1000+ nodes.
Regards,
Vipul
From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of John Hearns
via users
Sent: Saturday, May 30, 2020 2:19 AM
To: Open MPI Users
Hi Ralph,
Thanks for your response.
I added the option "--mca plm_rsh_no_tree_spawn 1" to mpirun command line, but
I get a similar error. (pasted below).
Regards,
Vipul
Got 14 slots.
tmpdir is /tmp/194954128.1.all.q
pe_hostfile is /var/spool/sge/has2/active_jobs/194954128.1/pe_hostfile
has2.or