Re: [OMPI users] Proper way to stop MPI process

2011-09-30 Thread Ralph Castain
Sigterm should work - what version are you using? Ralph Sent from my iPad On Sep 28, 2011, at 1:40 PM, Xin Tong wrote: > I am wondering what the proper way of stop a mpirun process and the child > process it created. I tried to send SIGTERM, it does not respond to it ? > What kind of signal

[OMPI users] problem running with RoCE over 10GbE

2011-09-30 Thread Konz, Jeffrey (SSA Solution Centers)
Encountered a problem when trying to run OpenMPI 1.5.4 with RoCE over 10GbE fabric. Got this run time error: An invalid CPC name was specified via the btl_openib_cpc_include MCA parameter. Local host: atl3-14 btl_openib_cpc_include value: rdmacm Invalid name:

Re: [OMPI users] Open MPI process cannot do send-receive message correctly on a distributed memory cluster

2011-09-30 Thread Jack Bryan
Thanks, I am using non-blocking MPI_Isend to send out message and using blocking MPI_Recv to get the message. Each MPI_Isend use a distinct buffer to hold the message, which is not changed until the message is received. Then, the sender process waits for the MPI_Isend to be finished. Be

Re: [OMPI users] Open MPI process cannot do send-receive message correctly on a distributed memory cluster

2011-09-30 Thread Rayson Ho
You can use a debugger (just gdb will do, no TotalView needed) to find out which MPI send & receive calls are hanging the code on the distributed cluster, and see if the send & receive pair is due to a problem described at: Deadlock avoidance in your MPI programs: http://www.cs.ucsb.edu/~hnielsen/

[OMPI users] Open MPI process cannot do send-receive message correctly on a distributed memory cluster

2011-09-30 Thread Jack Bryan
Hi, I have a Open MPI program, which works well on a Linux shared memory multicore (2 x 6 cores) machine. But, it does not work well on a distributed cluster with Linux Open MPI. I found that the the process sends out some messages to other processes, which can not receive them. What is th

Re: [OMPI users] EXTERNAL: Re: Unresolved reference 'mbind' and 'get_mempolicy'

2011-09-30 Thread Ed Blosch
Thank you for all this information. Your diagnosis is totally right. I actually sent e-mail yesterday but apparently it never got through :< It IS the MPI application that is failing to link, not OpenMPI itself; my e-mail was not well written; sorry Brice. The situation is this: I am trying to

Re: [OMPI users] EXTERNAL: Re: Unresolved reference 'mbind' and 'get_mempolicy'

2011-09-30 Thread Jeff Squyres
I think the issue here is that it's linking the *MPI application* that is causing the problem. Is that right? If so, can you send your exact application compile line, and the the output of that compile line with "--showme" at the end? On Sep 29, 2011, at 4:24 PM, Brice Goglin wrote: > Le 28/

Re: [OMPI users] EXTERNAL: Re: Unresolved reference 'mbind' and 'get_mempolicy'

2011-09-30 Thread Jeff Squyres
On Sep 29, 2011, at 12:45 PM, Blosch, Edwin L wrote: > If I add --without-hwloc in addition to --without-libnuma, then it builds. > Is that a reasonable thing to do? Is there a better workaround? This > 'hwloc' module looks like it might be important. As a note of explanation: hwloc is effec

Re: [OMPI users] EXTERNAL: Re: Unresolved reference 'mbind' and 'get_mempolicy'

2011-09-30 Thread Jeff Squyres
On Sep 28, 2011, at 5:02 PM, Blosch, Edwin L wrote: > ./configure --prefix=/release/cfd/openmpi-intel --without-tm --without-sge > --without-lsf --without-psm --without-portals --without-elan --without-slurm > --without-loadleveler --without-libnuma --enable-mpirun-prefix-by-default > --enable-

Re: [OMPI users] Role of ethernet interfaces of startup of openmpi job using IB

2011-09-30 Thread Jeff Squyres
On Sep 30, 2011, at 6:29 AM, Salvatore Podda wrote: > For the above reason, does anyone, in the list, know which the order/ranking > by which the > ethernet interfaces will be qeuried in the case of multiple ones? > And which are the rules? They're all used equally. -- Jeff Squyres jsquy...@ci

Re: [OMPI users] Role of ethernet interfaces of startup of openmpi job using IB

2011-09-30 Thread Salvatore Podda
Thanks for the prompt reply! On Sep 27, 2011, at 6:35 AM, Salvatore Podda wrote: We would like to know if the ethernet interfaces play any role in the startup phase of an opempi job using InfiniBand In this case, where we can found some literature on this topic? Unfortunately, there's n

Re: [OMPI users] VampirTrace integration with VT_GNU_NMFILE environment variable

2011-09-30 Thread Matthias Jurenz
Hello, first, please consider that the VT versions integrated in Open MPI v1.5.x and v1.4.x are different - respectively the names of the environment variables for setting a pre-created symbol list: Open MPI v1.4.x: VT_NMFILE Open MPI v1.5.x: VT_GNU_NMFILE Furthermore, make su

Re: [OMPI users] MPIRUN + Environtment Variable

2011-09-30 Thread Eugene Loh
On 09/29/11 20:54, Xin Tong wrote: I need to set up some environment variables before I run my application ( appA ). I am currently using mpirun -np 1 -host socrates (socrates is another machine) appA. Before appA runs, it expects some environment variables to be set up. How do i do that ? %