That is wonderful, that fixes the observed problem for running with OB1. Has a bug for this been filed to get RDMA working on macs? The only working MPI lib is MPICH-GM as this problem happens with LAM-7.1.3 also.

So on track for one bug.

Would the person working on the DR PML like me to try anymore tests?

Brock Palen
Center for Advanced Computing
bro...@umich.edu
(734)936-1985


On Dec 7, 2006, at 9:50 AM, Scott Atchley wrote:

On Dec 6, 2006, at 3:09 PM, Scott Atchley wrote:

Brock and Galen,

We are willing to assist. Our best guess is that OMPI is using the
code in a way different than MPICH-GM does. One of our other
developers who is more comfortable with the GM API is looking into it.

We tried running with HPCC with:

$ mpirun -np 4 -machinefile hosts -mca btl ^tcp  -mca
btl_gm_min_rdma_size $((10*1024*1024)) ./hpcc.ompi.gm

and HPL passes. The problem seems to be in the RDMA fragmenting code
on OSX. The boundary values at the edges of the fragments are not
correct.

Scott
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users



Reply via email to