Jeff Squyres (jsquyres) wrote on 24/06/16 13:43:
Nikolay --
Thanks for all the detail! That helps a tremendous amount.
Open MPI actually uses IP networks in *two* ways:
1. for command and control
2. for MPI communications
Your use of btl_tcp_if_include regulates #2, but not #1 -- you need to add
another MCA param to regulate #1. Try this:
mpirun --mca btl_tcp_if_include venet0:0 --mca oob_tcp_if_include venet0:0
...
> See if that works.
Jeff, thanks a lot for such prompt reply, detailed explanation and suggestion! But unfortunately the
error is still the same:
[user@ct110 hello]$ /opt/openmpi/1.10.3/bin/mpirun --mca btl self,tcp --mca btl_tcp_if_include
venet0:0 --mca oob_tcp_if_include venet0:0 -np 1 -host 10.0.50.41 hostname
[ct111.domain.org:01054] [[12888,0],1] tcp_peer_send_blocking: send() to socket 9 failed: Broken
pipe (32)
[...snip...]