On Mar 13, 2009, at 6:17 AM, Raymond Wan wrote:

What doesn't work is:

[On Y] mpirun --host Y,Z --np 2 uname -a
[On Y] mpirun --host X,Y,Z --np 3 uname -a

...and similarly for machine Z. I can confirm that from any of the 3 machines, I can ssh to the other without typing in a password. I set up the RSA keys correctly [I think]. When I run the above commands, it just hangs. Adding "--verbose" doesn't produce any information...I don't know what it's doing. I had a longer running program than "uname" and I didn't see it appear on any of the machines. In fact [since it hangs], I don't see uname on "top", either. I do, however, see "mpirun" and "orted" on top, though.

I guess some setup is missing that X has that the other two do not have. Any suggestions on how to find out the cause of this problem? Thank you!



Do you see "rsh" or "ssh" in the output of "ps -eadf" when mpirun is hanging, perchance? If you, what happens if you copy-n-paste those command lines and run them manually?

--
Jeff Squyres
Cisco Systems

Reply via email to