[OMPI users] oob mca question

Aaron Knister Thu, 12 Nov 2009 21:35:10 -0500

Dear List,

I'm having a really weird issue with openmpi - version 1.3.3 (version1.2.8 doesn't seem to exhibit this behavior). Essentially when I startjobs from the cluster front-end node using mpirun, mpirun sits idlefor up to a minute and a half (for 30 nodes) before running thecommand I've given it. Running the same command on any other node inthe cluster returns in a fraction of a second. Upon further researchit appears its an issue with the way orted on the compute nodes areattempting to talk back to the front-end node. When I launch mpirunfrom the front-end node this is the process it spawns on the computenode (public ip scrambled for security purposes)-

orted --daemonize -mca ess env -mca orte_ess_jobid 1816657920 -mcaorte_ess_vpid 1 -mca orte_ess_num_procs 3 --hnp-uri 1816657920.0;tcp://130.X.X.X:56866;tcp://172.40.10.1:56866;tcp://172.20.10.1:56866

Throwing in some firewall debugging rules indicate that the computenodes were trying to talk back to mpirun on the front-end node overthe front-end node's public ip. Based on this, and looking at thearguments passed above it seemed as though the public ip of the frontend node was being tried before any its private IPs, and the delay Iwas seeing was orted waiting for the connection to the front-endnode's public ip to timeout before it tried it's cluster-facing ip andthe connection succeeded.

I was able to work around this by specifying "--mca oob_tcp_if_includebond0,eth0" to mpirun (the front-end node has 2 bonded nics as itscluster facing interface). When I provided that argument thepreviously experienced delay disappeared. I could easily put that intoopenmpi-mca-params.conf and be done with the problem but I would liketo know why openmpi chose to use the public ip of the node before it'sinternal IP and if this is expected behavior. I suspect that it maynot be.


-Aaron

[OMPI users] oob mca question

Reply via email to