[OMPI users] MX/BTL eager_limit/min_send_size

2007-08-21 Thread Kees Verstoep
Hi, I am currently working on a parallel app that shows some issues using MX/BTL (not MTL) with the current trunk version of OpenMPI. Basically, for its communication the app needs to do a lot of random <= 8KB MPI_Isend()s which are polled away using MPI_Iprobe() and MPI_Recv(). The async send

[OMPI users] mpi broadcast

2007-08-21 Thread Sefa Arslan
Hi, Is there a special way to broadcast a 2D array, or just using the standart MPI_Bcast is enough? When I try to send/broadcast 2D arrays every time operation is terminated, and get a message like: "rank 2 in job 71 X_49783 caused collective abort of all ranks exit status of rank 2: k

Re: [OMPI users] mpi broadcast

2007-08-21 Thread Jelena Pjesivac-Grbovic
Hi Sefa, I presume you are talking about C or C++. If yes, it depends how you allocated 2D array. If it is not continuous data (i.e. 1D array you are accessing in 2D manner) - you have to define special datatype. Hope this helps, Jelena Sefa Arslan wrote: Hi, Is there a special way to broadc

[OMPI users] How to build and use checkpoint/restart fault tolerance in Open MPI.

2007-08-21 Thread Hiep Bui Hoang
Hello, I'm Hiep, I'm trying to use checkpoint/restart feature in Open MPI. I had read information about this feature in https://svn.open-mpi.org/trac/ompi/wiki/ProcessFT_CR and Open-MPI-FT-CR-Draft-v1.pdf. I had built Open MPI from "trunk" which gotten by Subversion. But I don't know how to enable

Re: [OMPI users] How to build and use checkpoint/restart fault tolerance in Open MPI.

2007-08-21 Thread Josh Hursey
Hello, There are a few things you need to do to build Open MPI with Checkpoint/Restart support. By default Open MPI is configured without checkpoint/restart support. 1) Make sure you have BLCR successfully installed and loaded on your system(s) 2) configure Open MPI with the "--with-ft=cr"

Re: [OMPI users] How to build and use checkpoint/restart fault tolerance in Open MPI.

2007-08-21 Thread Josh Hursey
I just updated the Users Guide on the wiki. I added the following sections: - Compiling and Building - Example Checkpoint/Restart Use Cases Let me know if you have any trouble with this work. Cheers, Josh On Aug 21, 2007, at 1:46 PM, Josh Hursey wrote: Hello, There are a few things you n

[OMPI users] openmpi realloc() holding onto memory when glibc doesn't

2007-08-21 Thread Josh Aune
The realloc included with openmpi 1.2.3 is not releasing memory to the OS and is causing apps to go into swap. Attached is a little test program that shows calls to realloc not releasing the memory when compiled using mpicc, but when compiled directly with gcc (or icc) calling realloc() frees any

[OMPI users] building static and shared OpenMPI libraries on MacOSX

2007-08-21 Thread Lev Givon
According to the OpenMPI FAQ, specifying the config option --enable-static without specifying --disable-shared should build both shared and static versions of the libraries. When I tried these options on MacOSX 10.4.10 with OpenMPI 1.2.3, however, the following lines in the config output seem to im