On May 22, 2008, at 12:52 PM, Jim Kusznir wrote:

I installed openmpi 1.2.6 on my system, but now my users are
complaining about even more errors.  I'm getting this:

[compute-0-23.local:26164] [NO-NAME] ORTE_ERROR_LOG: Not found in file
runtime/orte_init_stage1.c at line 182
--------------------------------------------------------------------------
Sorry!  You were supposed to get help about:
   orte_init:startup:internal-failure
from the file:
   help-orte-runtime
But I couldn't find any file matching that name.  Sorry!
--------------------------------------------------------------------------

Everything below this message is a consequence of the first message (above).

There's two problems here:

1. Where are the help files -- why can't OMPI find them? That's really weird; it suggests a broken Open MPI install. You have a few pending e-mails to me about RPM builds that I need to go read (I'm sorry; I'm way backed up :-( ); I wonder if this is somehow related...?

2. The specific error that is occurring is that the ORTE layer in OMPI is unable to initialize its out-of-band messaging system (we call it the "RML") which is *really* weird. The only reason that I can think that that would occur is a broken OMPI install.

Is there any chance that there are some files missing from your OMPI installs? For example, do you see these two files under $prefix/lib/ openmpi (or wherever $pkglibdir was set to):

mca_rml_oob.la*
mca_rml_oob.so*

--
Jeff Squyres
Cisco Systems

Reply via email to