Re: [OMPI users] MPI_Barrier hangs on second attempt but only when multiple hosts used.

2014-05-05 Thread Daniels, Marcus G
From: Clay Kirkland [mailto:clay.kirkl...@versityinc.com] Sent: Friday, May 02, 2014 03:24 PM To: us...@open-mpi.org Subject: [OMPI users] MPI_Barrier hangs on second attempt but only when multiple hosts used. I have been using MPI for many many years so I have very well debugged mpi tests

Re: [OMPI users] MPI_Barrier hangs on second attempt but only when multiple hosts used.

2014-05-03 Thread Ralph Castain
Hmmm...just testing on my little cluster here on two nodes, it works just fine with 1.8.2: [rhc@bend001 v1.8]$ mpirun -n 2 --map-by node ./a.out In rank 0 and host= bend001 Do Barrier call 1. In rank 0 and host= bend001 Do Barrier call 2. In rank 0 and host= bend001 Do Barrier call 3. In r

[OMPI users] MPI_Barrier hangs on second attempt but only when multiple hosts used.

2014-05-02 Thread Clay Kirkland
I have been using MPI for many many years so I have very well debugged mpi tests. I am having trouble on either openmpi-1.4.5 or openmpi-1.6.5 versions though with getting the MPI_Barrier calls to work. It works fine when I run all processes on one machine but when I run with two or more host

Re: [OMPI users] mpi_barrier

2013-09-29 Thread Huangwei
Dear George, Please see below. On 29 September 2013 01:03, George Bosilca wrote: > > On Sep 29, 2013, at 01:19 , Huangwei wrote: > > Dear All, > > In my code I implement mpi_send/mpi_receive for an three dimensional real > array, and process is as follows: > > all other processors send the ar

Re: [OMPI users] mpi_barrier

2013-09-28 Thread George Bosilca
On Sep 29, 2013, at 01:19 , Huangwei wrote: > Dear All, > > In my code I implement mpi_send/mpi_receive for an three dimensional real > array, and process is as follows: > > all other processors send the array to rank 0 and then rank 0 receives the > array and put these arrays into a comple

[OMPI users] mpi_barrier

2013-09-28 Thread Huangwei
Dear All, In my code I implement mpi_send/mpi_receive for an three dimensional real array, and process is as follows: all other processors send the array to rank 0 and then rank 0 receives the array and put these arrays into a complete array. Then mpi_bcast is called to send the complete array fr

Re: [OMPI users] MPI_Barrier in Self-checkpointing call

2012-02-15 Thread Josh Hursey
When you receive that callback the MPI has ben put in a quiescent state. As such it does not allow MPI communication until the checkpoint is completely finished. So you cannot call barrier in the checkpoint callback. Since Open MPI did doing a coordinated checkpoint, you can assume that all process

[OMPI users] MPI_Barrier in Self-checkpointing call

2012-02-15 Thread Faisal Shahzad
Dear Group, I wanted to do a synchronization check with 'MPI_Barrier(MPI_COMM_WORLD)' in 'opal_crs_self_user_checkpoint(char **restart_cmd)' call. Although every process is present in this call, it fails to synchronize. Is there any reason why cant we use barrier?Thanks in advance. Kind regards

Re: [OMPI users] MPI_Barrier, again

2012-02-15 Thread Evgeniy Shapiro
P.P.S. I ran the same test with OpenMPI 1.5.4, the behaviour is the same. Evgeniy Message: 10 List-Post: users@lists.open-mpi.org Date: Sat, 28 Jan 2012 08:24:39 -0500 From: Jeff Squyres Subject: Re: [OMPI users] MPI_Barrier, again To: Open MPI Users Message-ID: <1859c141-813d-46ba-9

Re: [OMPI users] MPI_Barrier, again

2012-02-06 Thread Evgeniy Shapiro
P.S. I have tested with OpenMPI 1.4.5rc4 and the problem is still there. Evgeniy

Re: [OMPI users] MPI_Barrier, again

2012-01-30 Thread Evgeniy Shapiro
./mpibarriertest.f90 OpenMPI: 1.4.3 hangs with 15 processes randomly as described. Evgeniy Message: 10 List-Post: users@lists.open-mpi.org Date: Sat, 28 Jan 2012 08:24:39 -0500 From: Jeff Squyres Subject: Re: [OMPI users] MPI_Barrier, again To: Open MPI Users Message-ID: <1859c141-813d-46ba-97bc-4b0290

Re: [OMPI users] MPI_Barrier, again

2012-01-28 Thread Jeff Squyres
Is there any chance you can make a small-ish reproducer of the issue that we can run? On Jan 27, 2012, at 10:45 AM, Evgeniy Shapiro wrote: > Hi > > I have a strange problem with MPI_Barrier occurring when writing to a > file. The output subroutine (the code is in FORTRAN) is called from > the m

[OMPI users] MPI_Barrier, again

2012-01-27 Thread Evgeniy Shapiro
Hi I have a strange problem with MPI_Barrier occurring when writing to a file. The output subroutine (the code is in FORTRAN) is called from the main program and there is an MPI_Barrier just before the call. In the subroutine 1. Process 0 checks whether the first file exists and, if not, - creat

Re: [OMPI users] MPI_Barrier() consuming CPU cycles

2009-12-04 Thread Nicolas Bock
Hi Jeff, thanks for the explanation. Yes, some of the earlier discussion where in fact very useful. In general I found this list to be very helpful, my thanks to everyone here who is helping people like me out. The suggestion to use messages and non-blocking receives with MPI_Test() proved just w

Re: [OMPI users] MPI_Barrier() consuming CPU cycles

2009-12-04 Thread Jeff Squyres
On Dec 4, 2009, at 6:54 PM, Nicolas Bock wrote: > in our code we use a very short front-end program to drive a larger set of > codes that do our calculations. Right in the beginning of the front-end, we > have an if() statement such that only the rank 0 front-end does something, > and the other

[OMPI users] MPI_Barrier() consuming CPU cycles

2009-12-04 Thread Nicolas Bock
Hello list, in our code we use a very short front-end program to drive a larger set of codes that do our calculations. Right in the beginning of the front-end, we have an if() statement such that only the rank 0 front-end does something, and the other ranks go right away to an MPI_Barrier() statem

[OMPI users] MPI_Barrier called late within ompi_mpi_finalize when MPIIO fd not closed

2009-07-20 Thread Jed Brown
This helped me track down a leaked file descriptor, but I think the order of events is not desirable. If an MPIIO file descriptor is not closed before MPI_Finalize, I get the following. *** An error occurred in MPI_Barrier *** after MPI was finalized *** MPI_ERRORS_ARE_FATAL (your MPI job will n

Re: [OMPI users] MPI_Barrier Error?

2006-12-16 Thread Jeff Squyres
I have no idea how this e-mail slipped by me -- I'm sorry for the delay in replying. Can you run your code through a memory-checking debugger such as valgrind and see if it turns up any ancillary memory problems? On Dec 1, 2006, at 1:52 PM, Cupp, Matthew R wrote: Hi, In my program I a

[OMPI users] MPI_Barrier Error?

2006-12-01 Thread Cupp, Matthew R
Hi, In my program I am calling MPI_Barrier(MPI_COMM_WORLD), but it seems to cause an error on one node. The node changes depending on how many total nodes I have (it could be 4 or 2). I'm thinking it's MPI_Barrier because I put print statements around it and that's where the program terminate