On Feb 14, 2007, at 12:28 PM, Mark Kosmowski wrote:

Everything is working properly now.  I needed to reinstall Linux on
one of my nodes after a botched attempt at a network install - mpirun
... hostname worked, but my application hung and gave a connect()
errno 110.

At this point I decided to give up and try mpich instead.  During the
mpich sanity checking, there was a more verbose error message
regarding the failed node, so I reinstalled the OS, reconfigured my
environment variables for OpenMPI and everything is now working.

Blah.  We definitely need to work on our error messages.

FWIW, what did MPICH say for the error?

--
Jeff Squyres
Server Virtualization Business Unit
Cisco Systems

Reply via email to