Re: [OMPI users] MPI Datatypes and RMA

Gilles Gouaillardet Mon, 2 May 2016 00:42:33 -0400 (EDT)

Bruce,

this issue was previously fixed on master and v2.x, but for somereasons, the fix was not backported to v1.10


i made a PR at https://github.com/open-mpi/ompi-release/pull/1120/files

in the mean time, feel free to manually apply the patch athttps://patch-diff.githubusercontent.com/raw/open-mpi/ompi-release/pull/1120.patch



Cheers,


Gilles


On 4/30/2016 7:40 AM, Palmer, Bruce J wrote:

I’ve been trying to recreate the semantics of the Global Array gatherand scatter operations using MPI RMA routines and I’ve run into someissues with MPI Datatypes. I’ve been focusing on building MPI versionsof the GA gather and scatter calls, which I’ve been trying toimplement using MPI data types built with the MPI_Type_create_structcall. I’ve developed a test program that simulates copying data intoand out of a 1D distributed array of size NSIZE. Each processorcontains a segment of approximately size NSIZE/nproc and isresponsible for assigning every nprocth value in the array startingwith the value indexed by the rank of the array. After assigningvalues and synchronizing the distributed data structure, eachprocessor then reads the values set by the processor of next higherrank (the process with rank nproc-1 reads the values set by process 0).
The distributed array is represented by and MPI window and createdusing a standard MPI_Win_create call. The values in the array are setand read using MPI RMA operations, either MPI_Get/MPI_Put orMPI_Rget/MPI_Rput. Three different protocols have been used. The firstis to call MPI_Win_lock and create a shared lock on the remoteprocessor, then call MPI_Put/MPI_Get and then call MPI_Win_unlock toclear the lock. The second protocol is to use MPI request-based calls.After the call to MPI_Win_create, MPI_Win_lock_all is called to starta passive synchronization epoch on the window. Data is written andread to the distributed array using MPI_Rput/MPI_Rget immediatelyfollowed by a call to MPI_Wait, using the handle returned by theMPI_Rput/MPI_Rget call. The third protocol also immediately creates apassive synchronization epoch after window creation, but uses calls toMPI_Put/MPI_Get immediately followed by a call to MPI_Win_flush_local.These three protocols seem to cover all the possibilities that I haveseen in other MPI/RMA based implementations of ARMCI/GA.
The issue that I’ve run into is that these tests seem to work reliablyif I build the data type using the MPI_Type_create_subbarray functionbut fail for larger arrays (NSIZE ~ 10000) when I useMPI_Type_create_struct. Because the values being set by each processorare evenly spaced, I can use either function in this case (this is notgenerally true in applications). The struct data type hangs on 2processors using lock/unlock, crashes for the request-based protocoland does not get the correct values in the Get phase of the datatransfer when using flush_local. These tests are done on a Linuxcluster using an Infiniband interconnect and the value of NSIZE is10000. For comparison, the same test using MPI_Type_create_subarrayseems to function reliably for all three protocols for NSIZE=1000000using 1,2,8 processors on 1 and 2 SMP nodes.
I’ve attached the test program for these test cases. Does anyone havea suggestion about what might be going on here?
Bruce



_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: 
http://www.open-mpi.org/community/lists/users/2016/04/29059.php

Re: [OMPI users] MPI Datatypes and RMA

Reply via email to