[OMPI users] OpenMPI checkpoint/restart on multiple nodes

2010-02-07 Thread Andreea Costea
Hi, Let's say I have an MPI application running on several hosts. Is there any way to checkpoint this application without having a shared storage between the nodes? I already took a look at the examples here http://www.osl.iu.edu/research/ft/ompi-cr/examples.php, but it seems that in both cases th

[OMPI users] Difficulty with MPI_Unpack

2010-02-07 Thread Prentice Bisbal
Hello, everyone. I'm having trouble packing/unpacking this structure: typedef struct{ int index; int* coords; }point; The size of the coords array is not known a priori, so it needs to be a dynamic array. I'm trying to send it from one node to another using MPI_Pack/MPI_Unpack as shown below.

Re: [OMPI users] Trapping fortran I/O errorsleavingzombiempiprocesses

2010-02-07 Thread Laurence Marks
The attached is a more general test. Please note that these are very sensitive to compilation options with ifort, and probably other compilers as well. In many cases openmpi fails test 1 and perhaps some of the others. Openmpi is not alone; the version of mvapich on my cluster fails a simple "CNTRL