Meantime, I’ve created a patch that should address this problem:

https://github.com/open-mpi/ompi-release/pull/227 
<https://github.com/open-mpi/ompi-release/pull/227>

If you can and would like, please see if it resolves this for you.


> On Apr 7, 2015, at 9:29 PM, Ralph Castain <r...@open-mpi.org> wrote:
> 
> Hmmm…yeah, we’ve been discussing this point. It’s a bit of a mixed bag. We 
> hit problems where people don’t get their paths set correctly on remote 
> machines, and then we hang because of bad connections between incompatible 
> versions. Same time, we hit these situations.
> 
> We’re getting ready to release 1.8.5 - let me discuss with the team about 
> what we can/should do to resolve these problems.
> 
> 
>> On Apr 7, 2015, at 8:43 PM, Alan Wild <a...@madllama.net 
>> <mailto:a...@madllama.net>> wrote:
>> 
>> I know this isn't "recommend", but a vendor recently gave me an executable 
>> compiled openmpi-1.8.3 and I happened to have recently completed a build of 
>> 1.8.4 (but didn't have 1.8.3 sitting around and the vendor refuses to 
>> provide his build).
>> 
>> Since these releases are so close they should be ABI compatible so I thought 
>> I would see what happens...
>> 
>> [arwild1@hplcslsp2 ~]$ mpirun -n 2 -H localhost vendor_app_mpi
>> [hplcslsp2:11394] [[56032,0],0] tcp_peer_recv_connect_ack: received 
>> different version from [[56032,1],0]: 1.8.3 instead of 1.8.4
>> [hplcslsp2:11394] [[56032,0],0] tcp_peer_recv_connect_ack: received 
>> different version from [[56032,1],1]: 1.8.3 instead of 1.8.4
>> 
>> and then everything hangs.  I can clearly see the output coming from 
>> 
>> ./orte/mca/oob/tcp/oob_tcp_connection.c
>> 
>> and where it returns
>> 
>> return ORTE_ERR_CONNECTION_REFUSED;
>> 
>> 
>> So it looks like I'm going to have to at least build 1.8.3, but is there any 
>> way to work around this given we are dealing with builds that are that 
>> close?  I'm really not interested in "rolling back" to 1.8.3 or providing 
>> both releases on my system.  
>> 
>> (yes, "right answer" is to get the vendor to provide his build... long stoy)
>> 
>> -Alan
>> 
>> 
>> 
>> -- 
>> a...@madllama.net <mailto:a...@madllama.net> http://humbleville.blogspot.com 
>> <http://humbleville.blogspot.com/>_______________________________________________
>> users mailing list
>> us...@open-mpi.org <mailto:us...@open-mpi.org>
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/users/2015/04/26645.php
> 

Reply via email to