Hi David:
On Fri, Sep 03, 2010 at 10:50:02AM +1000, David Singleton wrote:
>
> I'm sure this has been discussed before but having watched hundreds of
> thousands of cpuhrs being wasted by difficult-to-detect hung jobs, I'd
> be keen to know why there isn't some sort of "spin-wait backoff" option.
I'm sure this has been discussed before but having watched hundreds of
thousands of cpuhrs being wasted by difficult-to-detect hung jobs, I'd
be keen to know why there isn't some sort of "spin-wait backoff" option.
For example, a way to specify spin-wait for x seconds/cycles/iterations
then backo
On 1 Sep 2010, at 23:32, Jaison Mulerikkal wrote:
> Hi,
>
> I am getting interested in this thread.
>
> I'm looking for some solutions, where I can redirect a task/message
> (MPI_send) to a particular process (say rank 1), which is in a queue (at rank
> 1) to another process (say rank 2), if t
On 2 Sep 2010, at 15:56, Brock Palen wrote:
> Ashly still having trouble using padb with openmpi/1.4.2
>
> [dianawon@nyx0862 ~]$ /home/software/rhel5/padb/3.0/padb -a -Q
> [nyx0862.engin.umich.edu:30717] [[16608,0],0]-[[25542,0],0] oob-tcp:
> Communication retries exceeded. Can not communicate
Ashly still having trouble using padb with openmpi/1.4.2
[dianawon@nyx0862 ~]$ /home/software/rhel5/padb/3.0/padb -a -Q
[nyx0862.engin.umich.edu:30717] [[16608,0],0]-[[25542,0],0] oob-tcp:
Communication retries exceeded. Can not communicate with peer
[nyx0862.engin.umich.edu:30717] [[16608,0],0]
Ah ok, I put it there just because the user couldn't read that from my home
space, and never even thought of that. gahhh.
Thanks,
BTW I tried joining the padb mailing list.
Brock Palen
www.umich.edu/~brockp
Center for Advanced Computing
bro...@umich.edu
(734)936-1985
On Sep 1, 2010, at 6:11
On Aug 31, 2010, at 5:39 PM, Patrik Jonsson wrote:
> It seems a bit presumptuous of mpi.h to just include mpicxx.h just
> because __cplusplus is defined, since that makes it impossible to link
> C MPI code from C++.
The MPI standard requires that work in both C and C++ applications. It
also re