Re: [OMPI users] open-mpi ssh hostname problem

2009-02-06 Thread Jeff Squyres
I'm not quite sure what you did here; did you set the IP address and hostname to something that is resolvable via gethostbyname()? E.g., does the hostname exist in DNS or in /etc/hosts and match the IP address that you set? On Feb 6, 2009, at 6:18 AM, Bernhard Knapp wrote: Dear users I

Re: [OMPI users] Global Communicator

2009-02-06 Thread Jeff Squyres
On Feb 6, 2009, at 11:59 AM, McCalla, Mac wrote: Does the "default mpirun command" implementation match the build environment for quest_ompi.x ? ie., what mpi implementation (mpich, LAM/MPI, OPENMPI, or other) was quest_ompi.x compiled/linked with? and does that match the result of "which

Re: [OMPI users] OpenMPI hangs across multiple nodes.

2009-02-06 Thread Jeff Squyres
Open MPI requires that there be no TCP firewall between hosts that are used in a single parallel job -- it uses random TCP ports between peers. On Feb 5, 2009, at 2:39 AM, Robertson Burgess wrote: I have checked with IT. It is TCP. I have been told that there's a firewall on the nodes. Shou

Re: [OMPI users] signal 15 (terminated)

2009-02-06 Thread Jeff Squyres
Sorry for the delay in replying; this turned into a hectic week... On Feb 4, 2009, at 11:28 AM, Hana Milani wrote: Jeff, Thanks for helping me. Is this a Fortran program, perchance? Yes, it has been written by f77, but I have compiled it with gfortran. People have also done the same with no

Re: [OMPI users] Global Communicator

2009-02-06 Thread McCalla, Mac
Hi Ted, Does the "default mpirun command" implementation match the build environment for quest_ompi.x ? ie., what mpi implementation (mpich, LAM/MPI, OPENMPI, or other) was quest_ompi.x compiled/linked with? and does that match the result of "which mpirun"? You might try running a job using you

Re: [OMPI users] Global Communicator

2009-02-06 Thread Ted Yu
Just to make sure, because I have to use open mpi for this program: I'm using the default mpirun command. When I type "man mpirun", these are the first few lines: MPIRUN(1) OPEN MPI COMMANDS MPIRUN(1) NAME    orterun,  mpirun,  mpiexec  - Execute seri

Re: [OMPI users] Global Communicator

2009-02-06 Thread Ralph Castain
Hi Ted From what I can tell, you are not using Open MPI, but mpich's mpirun. You might want to ask for help on their mailing list. Ralph On Feb 6, 2009, at 8:49 AM, Ted Yu wrote: Thanx for the reply. I guess I should go back a step: I had used the openmpi version on my system which is

Re: [OMPI users] Global Communicator

2009-02-06 Thread Ted Yu
Thanx for the reply.  I guess I should go back a step:  I had used the openmpi version on my system which is simply: "mpirun -machinefile $PBS_NODEFILE -np $NPROCS ${CODE} >/ul/tedhyu/fuelcell/HOH/test/HH.out" This did not work because I was just getting a blank output. I tried this older ver

[OMPI users] open-mpi ssh hostname problem

2009-02-06 Thread Bernhard Knapp
Dear users I am using the parallel software Gromacs on Fedora8 nodes. I installed the software and run it without problems but thereafter I moved the node to our server-room and did the following: - set ip adress, subnetmask and gateway - changed the ssh port in /etc/ssh/sshd_config since we u

Re: [OMPI users] Global Communicator

2009-02-06 Thread doriankrause
Ted Yu wrote: I'm trying to run a job based on openmpi. For some reason, the program and the global communicator are not in sync and it reads that there is only one processors, whereas, there should be 2 or more. Any advice on where to look? Here is my PBS script. Thanx! PBS SCRIPT: #!/bi