Re: [OMPI users] GPU and CPU timing - OpenMPI and Thrust

Eduardo Morras Wed, 9 May 2012 11:13:52 -0400

At 15:59 08/05/2012, you wrote:

Yep you are correct. I did the same and it worked. When I have morethan 3 MPI tasks there is lot of overhead on GPU.
But for CPU there is not overhead. All three machines have 4 quadcore processors with 3.8 GB RAM.
Just wondering why there is no degradation of performance on CPU ?

Your GPU is saturated. It has more work than it can handle so itsperformance drops.

If your kernel code is the one you posted some days ago you candivide the number of threads and multiply the work done in each one,so you do the same work (maybe faster) without using/wasting all thethread pool and sm bandwith.

HTH

Re: [OMPI users] GPU and CPU timing - OpenMPI and Thrust

Reply via email to