On May 27, 2008, at 9:33 AM, Gabriele Fatigati wrote:
Great, it works!
Thank you very very much.
But, can you explain me how this parameter works?
You might want to have a look at this short video for a little
background on some relevant OpenFabrics concepts:
http://www.open-mpi.org/video/?category=openfabrics#openfabrics-concepts
In v1.2, for short messages, OMPI will sometimes copy your message to
a pre-posted receive buffer, and immediately mark the MPI request as
"complete". Depending on the timing and current network resource
usage, the message may or may not have been given to the network stack
yet (e.g., if we're out of flow control credits to send to this
particular peer). If your application keeps dipping down into the MPI
layer frequently, this situation will almost certainly resolve itself
once the receiver becomes active or other events occur to free up
available resources. As such, the early completion optimization
pretty much depends on frequent calls to MPI. Without them, since
OMPI currently has no independent progression (e.g., a progress
thread), your message will wait until OMPI's internal progress engine
is tripped again.
Hope that helps.
On Thu, 15 May 2008 21:40:45 -0400, Jeff Squyres said:
Sorry this message escaped for so long it got buried in my INBOX.
The
problem you're seeing might be related to one we just answered
about a
similar situation:
http://www.open-mpi.org/community/lists/users/2008/05/5657.php
See if using the pml_ob1_use_early_completion flag works for you.
On Apr 30, 2008, at 7:05 AM, Gabriele FATIGATI wrote:
Hi,
i tried to run SkaMPI benchmark on IBM-BladeCenterLS21-BCX system
with 256 processors, but test has stopped on "AlltoAll-length"
routine, with count=8192 for some reasons.
I have launched test with:
--mca btl_openib_eager_limit 1024
Same tests with 128 processor or less, have finished successful.
Different values of eager limit dont' solve the problem. Thanks in
advance.
--------------------------
Gabriele Fatigati
CINECA Systems & Tecnologies Department
Supercomputing Group
Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
www.cineca.it Tel: 39 051 6171722
g.fatig...@cineca.it
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
--
Jeff Squyres
Cisco Systems
_______________________________________________
users mailing
--------------------------
Gabriele Fatigati
CINECA Systems & Tecnologies Department
Supercomputing Group
Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
www.cineca.it Tel: 39 051 6171722
g.fatig...@cineca.it
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
--
Jeff Squyres
Cisco Systems