Re: [OMPI users] [Open MPI Announce] Open MPI v1.4.5 released

2012-02-14 Thread alexalex43210
Hello Benton, Thanks for your notice. But I am a novice for the parallel computation, I often use Fortran to compile my program, now I want to use the Parallel, can you give me some help how to begin? PS: I learned about OPEN MPI is the choice for my question solution. am I right? best

Re: [OMPI users] Strange OpenMPI messages

2012-02-14 Thread Gustavo Correa
Hi Tohiko OpenFabrics network a.k.a. Infiniband a.k.a. IB. To check if the compute nodes have IB interfaces, try: lspci [and search the output for Infinband] To see if the IB interface is configured try: ifconfig -a [and search the output for ib0, ib1, or similar] To check if the OFED module

[OMPI users] Different Prefix for different nodes

2012-02-14 Thread Tohiko Looka
Hello, I'm trying to run my application on different nodes; each with a different path to OpenMPI libraries and binaries. According to the documentation I can set '-prefix' on a per-context basis, so I can set '-prefix' differently for each node, but I wasn't able to do it and I didn't find an exa

Re: [OMPI users] Strange OpenMPI messages

2012-02-14 Thread Tohiko Looka
Sorry for the noob question, but how do I check my network type and if OFED service is running correctly or not? And how do I run it Thank you, On Tue, Feb 14, 2012 at 2:14 PM, Jeff Squyres wrote: > Do you have an OpenFabrics-based network? (e.g., InfiniBand or iWarp) > > If so, this error mes

[OMPI users] EuroMPI 2012 CALL FOR PAPERS

2012-02-14 Thread George Bosilca
[Apologies if you got multiple copies of this email.] EuroMPI 2012 CALL FOR PAPERS Recent Advances in Message Passing Interface. 19th European MPI Users' Group Meeting Austrian Academy of Sciences, Vienna, Austria, September 23-26, 2012 www.eurompi2012.org BACKGROUND AND TOPICS EuroMPI is th

[OMPI users] MPI_Waitall strange behaviour on remote nodes

2012-02-14 Thread Richard Bardwell
In trying to debug an MPI_Waitall hang on a remote node, I created a simple code to test. If we run the simple code below on 2 nodes on a local machine, we send the number 1 and receive number 1 back. If we run the same code on a local node and a remote node, we send number 1 but get 32767 back.

Re: [OMPI users] help: sm btl does not work when I specify the same host twice or more in the node list

2012-02-14 Thread yanyg
Yes, in short, I start a c-shell script from bash command line, in which I mpirun another c-shell script which start the computing process. The only OMPI related envars are PATH and LD_LIBRARY_PATH. Any other OPMI envars I should set?

Re: [OMPI users] help: sm btl does not work when I specify the same host twice or more in the node list

2012-02-14 Thread Ralph Castain
Guess I am confused. It was my impression that you mpirun a script that actually starts your process. Yes? Sent from my iPad On Feb 14, 2012, at 7:32 AM, ya...@adina.com wrote: > Hi Ralph, > > Could you please tell me what OMPI envars are broken? or what > OMPI envars should be there for OMPI

Re: [OMPI users] help: sm btl does not work when I specify the same host twice or more in the node list

2012-02-14 Thread yanyg
Hi Ralph, Could you please tell me what OMPI envars are broken? or what OMPI envars should be there for OMPI to work properly? Although I start my c-shell script from a bash command line(not sure if this matters), I only add Open MPI executable and lib path to $PATH and $LD_LIBRARY_PATH, no ot

Re: [OMPI users] help: sm btl does not work when I specify the same host twice or more in the node list

2012-02-14 Thread Ralph Castain
It looks like your script is stripping away the OMPI envars. That will break the job. Can you look at the script and see why it does that? Sent from my iPad On Feb 14, 2012, at 6:44 AM, ya...@adina.com wrote: > Hi Jeff, > > The command "env | grep OMPI" output nothing but a blank line > from

Re: [OMPI users] help: sm btl does not work when I specify the same host twice or more in the node list

2012-02-14 Thread yanyg
Hi Jeff, The command "env | grep OMPI" output nothing but a blank line from my script. Anything I should set for mpirun? On the other hand, you may get reminded that I found you discussed some similar issue with Jonathan Dursi. The difference is that when I tried with --mca btl_sm_num_fifos #(

Re: [OMPI users] MPI orte_init fails on remote nodes

2012-02-14 Thread Richard Bardwell
In trying to debug the MPI_Waitall hang on the remote node, I created a simple code to test. If we run the simple code below on 2 nodes on a local machine, we send the number 1 and receive number 1 back. If we run the same code on a local node and a remote node, we send number 1 but get 32767 ba

Re: [OMPI users] MPI orte_init fails on remote nodes

2012-02-14 Thread Richard Bardwell
OK, I get it running if I specify /usr/local/bin/mpiexec instead of just mpiexec. Now, the program hangs at the first MPI_Waitall on the remote node. The program runs just fine if both nodes are on the same machine. Any ideas how to debug this ? Many Thanks Richard - Original Message -

Re: [OMPI users] Strange OpenMPI messages

2012-02-14 Thread Jeff Squyres
Do you have an OpenFabrics-based network? (e.g., InfiniBand or iWarp) If so, this error message usually means that OFED is either installed incorrectly, or is not running properly (e.g., its services didn't get started properly upon boot). If you don't have an OpenFabrics-based network, then i

Re: [OMPI users] MPI orte_init fails on remote nodes

2012-02-14 Thread Jeff Squyres
Make sure that your LD_LIBRARY_PATH is being set in your shell startup files for *non-interactive logins*. For example, ensure that LD_LIBRARY_PATH is set properly, even in this case: - ssh some-other-node env | grep LD_LIBRARY_PATH - (note that this is different than "ssh some-other-no

Re: [OMPI users] help: sm btl does not work when I specify the same host twice or more in the node list

2012-02-14 Thread Jeff Squyres
On Feb 13, 2012, at 2:26 PM, ya...@adina.com wrote: > I tried to run the same test of ring_c from standard examples in > Open MPI 1.4.3 distribution. If I ran as you described from the > command line, it worked without any problem with sm btl > included(with --mca btl self,sm,openib). However,

Re: [OMPI users] MPI orte_init fails on remote nodes

2012-02-14 Thread Richard Bardwell
Jeff, I wiped out all versions of openmpi on all the nodes including the distro installed version. I reinstalled version 1.4.4 on all nodes. I now get the error that libopen-rte.so.0 cannot be found when running mpiexec across different nodes, even though the LD_LIBRARY_PATH for all nodes point

[OMPI users] Strange OpenMPI messages

2012-02-14 Thread Tohiko Looka
Greetings, Until today I was running my openmpi applications with no errors/warnings Today I restarted my computer (possibly after an automatic openmpi update) and got these warnings when running my program [tohiko@kw12614 1d]$ mpirun -x LD_LIBRARY_PATH -hostfile hosts -np 10 hello librdmacm: coul