Hi Tim, Hmmmm, nope. I recompiled OpenMPI to produce the static libs, and even recompiled my app statically, and received the same error messages.
If orted isn't starting on the compute nodes, is there any way I can debug this to find out where it is failing? Actually, I just tried your suggestion of running ldd on one of the compute nodes (should've tried this before recompiling, I guess...). I get a strange error, which probably indicates a problem with bproc: ldd /usr/local/intel-libs/9.0/bin/orted /proc/self/fd/3: line 125: cat: command not found I know I've run ldd on a node before without problems before.... I don't know if this is related to my OpenMPI problems, but I'm going to have to look into it. John >Hello John, > >You need to specify both --enable-static and --disable-shared to do a static >build (not sure why, perhaps someone else can fill us in on that)... > >The logs indicate the launch is failing trying to start orted on the backend >node... probably due to shared library dependencies. > >You might try doing a bpsh <node> ldd orted > >And check that the libraries resolve / and or rebuild with the indicated >configure option. >Regards, >Tim > -- +++++++++++++++++++++++++++++++++++ John Ouellette Department of Astrophysics American Museum of Natural History Ph: 212-313-7919 Fax: 212-769-5007