Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-17 Thread Clement Chu
Thanks Jeff. The problem is solved on the latest version (8172). Clement Jeff Squyres wrote: Clement -- Sorry for the delay in replying. We're running around crazy here at SC, which pretty much keeps us away from e-mail except early in the morning and late at night. We fixed a bunch of

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-16 Thread Mostyn Lewis
I get this on rh9 ONLY if I leave out a -hostfile option on mpirun, otherwise it works fine. This is an old Red Hat. Regards, Mostyn On Wed, 16 Nov 2005, Jeff Squyres wrote: Clement -- Sorry for the delay in replying. We're running around crazy here at SC, which pretty much keeps us away fr

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-16 Thread Jeff Squyres
Clement -- Sorry for the delay in replying. We're running around crazy here at SC, which pretty much keeps us away from e-mail except early in the morning and late at night. We fixed a bunch of things in the sm btl as of r8136 (someone reported similar issues as you, and we took the exchang

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-14 Thread Clement Chu
Hi Jeff, I tried the rc6 and trunk nightly 8150. I got the same problem. I copied the message from terminal as below. [clement@localhost testmpi]$ ompi_info Open MPI: 1.1a1r8113 Open MPI SVN revision: r8113 Open RTE: 1.1a1r8113 Open RTE SVN revision: r811

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-10 Thread Clement Chu
Thanks for your help. kfc is machine name and clement is the username of this machine. Do you think it is the problem? Then I tried to remove kfc machine and run again. This time I can run mpi program and there is no error message output, but it is no program output too. I think it is som

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-10 Thread Jeff Squyres
One minor thing that I notice in your ompi_info output -- your build and run machines are different (kfc vs. clement). Are these both FC4 machines, or are they different OS's/distros? On Nov 10, 2005, at 10:01 AM, Clement Chu wrote: [clement@kfc TestMPI]$ mpirun -d -np 2 test [kfc:29199] pr

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-10 Thread Jeff Squyres
The name of the launcher is "rsh", but it actually defaults to trying to fork/exec ssh. Unfortunately, your backtrace doesn't tell much because there are no debugging symbols. Can you recompile OMPI with debugging enabled and send a new backtrace? Use: ./configure CFLAGS=-g

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-10 Thread Clement Chu
there is the backtrace result: (now i am using 8085) Does mpirun start rsh?? I think I need ssh instead of rsh. [clement@kfc tmp]$ gdb mpirun core.17766 GNU gdb Red Hat Linux (6.3.0.0-1.21rh) Copyright 2004 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public L

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-10 Thread Clement Chu
[clement@kfc TestMPI]$ mpirun -d -np 2 test [kfc:29199] procdir: (null) [kfc:29199] jobdir: (null) [kfc:29199] unidir: /tmp/openmpi-sessions-clement@kfc_0/default-universe [kfc:29199] top: openmpi-sessions-clement@kfc_0 [kfc:29199] tmp: /tmp [kfc:29199] [0,0,0] setting up session dir with [kfc:291

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-10 Thread Jeff Squyres
I'm sorry -- I wasn't entirely clear: 1. Are you using a 1.0 nightly tarball or a 1.1 nightly tarball? We have made a bunch of fixes to the 1.1 tree (i.e., the Subversion trunk), but have not fully vetted them yet, so they have not yet been taken to the 1.0 release branch yet. If you have no

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-10 Thread Clement Chu
I have tried the latest version (rc5 8053), but the error is still here. Jeff Squyres wrote: We've actually made quite a few bug fixes since RC4 (RC5 is not available yet). Would you mind trying with a nightly snapshot tarball? (there were some SVN commits last night after the nightly snaps

Re: [O-MPI users] Error on mpirun in Redhat Fedora Core 4

2005-11-10 Thread Jeff Squyres
We've actually made quite a few bug fixes since RC4 (RC5 is not available yet). Would you mind trying with a nightly snapshot tarball? (there were some SVN commits last night after the nightly snapshot was made; I've just initiated another snapshot build -- r8085 should be on the web site