Raymond Wan wrote:
> 
> Hi Jeff,
> 
> Some "good" news (but still some bad news).  Y and Z are part of a set
> of 8 machines and I found out that mpirun works for one of them.  I
> didn't checked a couple of them before -- sorry!  However, I'm no closer
> to the solution since all 8 should be "identical", according to our
> sysadmin.  He said the only difference (that he can think of) between
> the working one and all the others is that the working one has an NIS
> server installed.  It is the NIS server for the cluster (presumably, the
> others run a client version).  Could that be the reason?  He can't think
> of anything else that distinguishes between them but he says it is
> possible that the NIS server is correctly configured for what we use it
> for, but not for what I'm doing with Open MPI -- he doesn't know what
> should be done, though.

In an earlier e-mail in this thread, I theorized that this might be a
problem with your name service. This latest information seems to support
that theory.

To test, on all 3 systems, use the 'host' command to see if you can
resolve the hostnames of all the 3 systems.

On host X, do this:

host X
host Y
host Z

Then do the same on hosts Y and Z.

If the 'host' command can resolve properly, you should see something
like this:

$ host foo
foo.example.com has address 192.168.1.1

If 'host' can't resolve  a hostname properly, you should see something
like this:

$ host bar
Host bar not found: 3(NXDOMAIN)

OpenMPI should be using the same nameservice libraries all the other
programs use, so I find it hard to believe everything *but* OpenMPI is
working propery, but I suppose it could be possible. I've seen weirder.


-- 
Prentice

Reply via email to