Re: [OMPI users] Trapping fortran I/O errorsleavingzombiempiprocesses

2010-02-07 Thread Laurence Marks
The attached is a more general test. Please note that these are very sensitive to compilation options with ifort, and probably other compilers as well. In many cases openmpi fails test 1 and perhaps some of the others. Openmpi is not alone; the version of mvapich on my cluster fails a simple "CNTRL

Re: [OMPI users] Trapping fortran I/O errorsleavingzombiempiprocesses

2010-02-06 Thread Laurence Marks
The following code reproduces the problem for mpif90/ifort 11.1/openmpi-1.4.1. With an empty test.input (touch test.input) some not reproducible number of zombies processes are created. include "mpif.h" call MPI_INIT(ierr) call MPI_COMM_RANK(MPI_COMM_WORLD, irank, ierr)

Re: [OMPI users] Trapping fortran I/O errorsleavingzombiempiprocesses

2010-02-06 Thread Laurence Marks
I managed to find time to reproduce the issue, although it is not very reproducible in it's results and I suspect it may not be easy to reproduce with a simple code plus I've never actually constructed a mpi code so (I am cc'ing Michael Sternberg who compiled the openmpi in case there are flags

Re: [OMPI users] Trapping fortran I/O errorsleavingzombiempiprocesses

2010-01-29 Thread Jeff Squyres
On Jan 29, 2010, at 9:13 AM, Laurence Marks wrote: > OK, but trivial codes don't always reproduce problems. Yes, but if the problem is a file reading beyond the end, that should be fairly isolated behavior. > Is strace useful? Sure. But let's check to see if the apps are actually dying or han