[OMPI users] Bandwidth efficiency advice

marcin.krotkiewski Fri, 26 May 2017 03:45:07 -0700

Dear All,

I would appreciate some general advice on how to efficiently implementthe following scenario.

I am looking into how to send a large amount of data over IB _once_, tomultiple receivers. The trick is, of course, that while the ping-pongbenchmark delivers great bandwidth, it does so by re-using the alreadyregistered memory buffers. Since I need to send the data once, thememory registration penalty is not easily avoided. I've been lookinginto the following approaches:

1. have multiple ranks send different parts of the data to differentreceivers, in the hope that the memory registration cost will be hidden2. pre-register two smaller buffers, into which a data is copied beforesending

The first approach is the best I've managed so far, but the bandwidthreached is still lower than what I observe using the pingpong benchmark.Also, the performance depends on the number of sending ranks and dropsif there are too many.

In the second approach one pays for a data copy. My thinking was thatsince the effective memory bandwidth available on a single modern CPU islarger than the IB bandwidth, I could squeeze out some performance bycombining double buffering and multithreading, e.g.,

Step 1. thread A sends the data in the current buffer. Behind thescenes, thread B copies data from memory to the next buffer

Step 2. buffers are switched

A similar idea would be to use MPI_Get on the remote rank. The senderwould copy the data from the memory to the second buffer while the RMAwindow with the first buffer is exposed. In theory, I would expect thosetwo operations to be executed simultaneously, with the memory copyhopefully hidden behind the IB transfer.

Of course, the experiments didn't really work. While the first(multi-rank) approach is OK and shows some improvement, the bandwidthcould still be improved. None of my double-buffering approaches workedat all, possibly because memory bandwidth contention.

So I was wondering, has any of you had any experience with similarapproaches? In your experience, what would be the best approach?


Thanks a lot!

Marcin

_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

[OMPI users] Bandwidth efficiency advice

Reply via email to