John,
Any progress on this?
John Ouellette wrote:
Hi Tim,
Hmmmm, nope. I recompiled OpenMPI to produce the static libs, and even
recompiled my app statically, and received the same error messages.
If orted isn't starting on the compute nodes, is there any way I can debug
this to find out where it is failing?
You could try running orted standalone to verify that it starts up correctly:
bpsh <node> orted --debug
Actually, I just tried your suggestion of running ldd on one of the compute
nodes (should've tried this before recompiling, I guess...). I get a strange
error, which probably indicates a problem with bproc:
ldd /usr/local/intel-libs/9.0/bin/orted
/proc/self/fd/3: line 125: cat: command not found
I know I've run ldd on a node before without problems before.... I don't know
if this is related to my OpenMPI problems, but I'm going to have to look into
it.
Tim