Re: [OMPI users] Can't start program across network -- solved!

2009-03-17 Thread Raymond Wan
Hi Prentice/Jeff, Prentice Bisbal wrote: In an earlier e-mail in this thread, I theorized that this might be a problem with your name service. This latest information seems to support that theory. Thank you very much for the suggestions and help! After discussing with our system administra

Re: [OMPI users] Can't start program across network

2009-03-16 Thread Prentice Bisbal
Raymond Wan wrote: > > Hi Jeff, > > Some "good" news (but still some bad news). Y and Z are part of a set > of 8 machines and I found out that mpirun works for one of them. I > didn't checked a couple of them before -- sorry! However, I'm no closer > to the solution since all 8 should be "iden

Re: [OMPI users] Can't start program across network

2009-03-16 Thread Prentice Bisbal
Raymond Wan wrote: > > Actually, when I run the above mpirun command, I don't see "sleep" > running locally on machine Y, either. However, if I did this: > > mpirun --host Y --np 3 sleep 1000 > > I see 3 instances of "sleep" when I do ps -aedf. Does mpirun try to > "ssh" all networked machines

Re: [OMPI users] Can't start program across network

2009-03-15 Thread Raymond Wan
Hi Jeff, Some "good" news (but still some bad news). Y and Z are part of a set of 8 machines and I found out that mpirun works for one of them. I didn't checked a couple of them before -- sorry! However, I'm no closer to the solution since all 8 should be "identical", according to our sysa

Re: [OMPI users] Can't start program across network

2009-03-14 Thread Jeff Squyres
Can you send all the information here: http://www.open-mpi.org/community/help/ (including the network information) Thanks! On Mar 13, 2009, at 9:12 PM, Raymond Wan wrote: Hi Jeff, Jeff Squyres wrote: > On Mar 13, 2009, at 6:17 AM, Raymond Wan wrote: > >> What doesn't work is: >> >> [

Re: [OMPI users] Can't start program across network

2009-03-13 Thread Raymond Wan
Hi Jeff, Jeff Squyres wrote: On Mar 13, 2009, at 6:17 AM, Raymond Wan wrote: What doesn't work is: [On Y] mpirun --host Y,Z --np 2 uname -a [On Y] mpirun --host X,Y,Z --np 3 uname -a ...and similarly for machine Z. I can confirm that from any of the 3 Do you see "rsh" or "ssh" in the o

Re: [OMPI users] Can't start program across network

2009-03-13 Thread Jeff Squyres
On Mar 13, 2009, at 6:17 AM, Raymond Wan wrote: What doesn't work is: [On Y] mpirun --host Y,Z --np 2 uname -a [On Y] mpirun --host X,Y,Z --np 3 uname -a ...and similarly for machine Z. I can confirm that from any of the 3 machines, I can ssh to the other without typing in a password. I

[OMPI users] Can't start program across network

2009-03-13 Thread Raymond Wan
Hi all, I'm having a problem running mpirun and I was wondering if there are suggestions on how to find out the cause. I have 3 machines that I can use: X, Y, and Z. The important thing is that X is different from Y and Z (the software installed, version of Linux, etc. Y and Z are identic