update 2: (its like I am talking to myself ... :) must start using
decaf ...)
Joe Landman wrote:
Joe Landman wrote:
[...]
ok, fixed this. Turns out we have ipoib going, and one adapter needed
to be brought down and back up. Now the tcp version appears to be
running, though I do get the
Joe Landman wrote:
3) using btl to turn off sm and openib, generates lots of these messages:
[c1-8][0,1,4][btl_tcp_endpoint.c:572:mca_btl_tcp_endpoint_complete_connect]
connect() failed with errno=113
[...]
No route to host at -e line 1.
This is wrong, all the nodes are visible from all
Hi folks:
This is a deeper dive into the code that was giving me fits over the
last two weeks.
It uses MPI_Waitsome and MPI_Allstart to launch/monitor progress.
More on that in a moment.
The testing I have done to date on this platform suggests that
OpenMPI is working fine, though I