FWIW: I'm in the process of porting some code from a branch that allows apps to
do on-demand checkpoint/recovery style operations at the app level.
Specifically, it provides the ability to:
* request a "recovery image" - an application-level blob containing state info
required for the app to re
There are some great comments in this thread. Process migration (like
many topics in systems) can get complex fast.
The Open MPI process migration implementation is checkpoint/restart
based (currently using BLCR), and uses an 'eager' style of migration.
This style of migration stops a process comp
On 8/23/2011 1:24 PM, Dick Kachuma wrote:
I have used gprof to profile a program that uses openmpi. The result
shows that the code spends a long time in poll (37% on 8 cores, 50% on
16 and 85% on 32). I was wondering if there is anything I can do to
reduce the time spent in poll.
In serial perfor
You may try to update your OFED version. I think 1.5.3 is the latest one.
Pavel (Pasha) Shamis
---
Application Performance Tools Group
Computer Science and Math Division
Oak Ridge National Laboratory
On Aug 25, 2011, at 7:46 PM, wrote:
>
> Hi all,
>
> it is more hardware or system confi