Let me add two more things.

Note that we *always* compare performance and acceleration to our
super-tuned state-of-the-art CPU code, which I can confidently say that is
among the fastest if not the fastest to date, and never to some slow (CPU)
implementation. Therefore, while other codes might be able to get 10-20x
with GPUs and many x-es even with pre-Fermi (CC <2.0) cards, for obvious
reasons we simply can't.

Still you'll get high *absolute performance* regardless whether you can use
CPU only or CPU+GPU, so this should be a decent deal, right?

If anybody volunteers to add software-based floating point atomic
operations for CC <2.0 and try running the current kernels on earlier
hardware, I'm willing to give pointers, but I simply had no time to try it
myself. This would enable using earlier cards, with a considerable
performance penalty of emulated atomic ops (plus these cards are anyway
slower), but it *might* be worth a try!

Feel free to drop me a mail if anybody is interested.

Cheers,

--
Szilárd


On Thu, Jan 24, 2013 at 3:48 PM, Szilárd Páll <szilard.p...@cbr.su.se>wrote:

> On Thu, Jan 24, 2013 at 3:28 PM, Justin Lemkul <jalem...@vt.edu> wrote:
>
>>
>>
>> On 1/24/13 9:23 AM, James Starlight wrote:
>>
>>> oh that was simply solved by upgrading of G++ :)
>>>
>>> the only problem which remains is the missing of support of mu GPU :(
>>> That time I've tried to start simulation on simply 2 cores CPU + geforce
>>> 9500
>>>
>>>  From md run I've obtained
>>>>
>>>
>>> NOTE: Error occurred during GPU detection:
>>>        CUDA driver version is insufficient for CUDA runtime version
>>>        Can not use GPU acceleration, will fall back to CPU kernels.
>>>
>>> ED sampling will be performed!
>>> ED: Reading edi file new_a2a_pca_biased.edi
>>> ED: Note: Reference and average structure are composed of the same atom
>>> indices.
>>> ED: Found 1 ED group.
>>> Using 1 MPI thread
>>> Using 2 OpenMP threads
>>>
>>> No GPUs detected
>>>
>>>
>>> I've installed cuda-5.0 with driver
>>> NVRM version: NVIDIA UNIX x86_64 Kernel Module  304.54  Sat Sep 29
>>> 00:05:49 PDT 2012
>>>
>>> that system works fine with the recent nvidia GPU's but has no support
>>> for GeForce 9500.
>>>
>>>
>> GeForce 9500 cards have a compute capability of 1.1.  The minimum
>> required for Gromacs is 2.0 (noted in the installation instructions).
>
>
> Exactly! Anything below CC 2.0 lacks certain features that make those
> devices rather  slow for our algorithms and therefore pretty much
> impractical.
>
> Cheers,
> --
> Szilárd
>
>
>
>
>
>>
>>
>> -Justin
>>
>> --
>> ==============================**==========
>>
>> Justin A. Lemkul, Ph.D.
>> Research Scientist
>> Department of Biochemistry
>> Virginia Tech
>> Blacksburg, VA
>> jalemkul[at]vt.edu | (540) 231-9080
>> http://www.bevanlab.biochem.**vt.edu/Pages/Personal/justin<http://www.bevanlab.biochem.vt.edu/Pages/Personal/justin>
>>
>> ==============================**==========
>>
>> --
>> gmx-users mailing list    gmx-users@gromacs.org
>> http://lists.gromacs.org/**mailman/listinfo/gmx-users<http://lists.gromacs.org/mailman/listinfo/gmx-users>
>> * Please search the archive at http://www.gromacs.org/**
>> Support/Mailing_Lists/Search<http://www.gromacs.org/Support/Mailing_Lists/Search>before
>>  posting!
>> * Please don't post (un)subscribe requests to the list. Use the www
>> interface or send it to gmx-users-requ...@gromacs.org.
>> * Can't post? Read 
>> http://www.gromacs.org/**Support/Mailing_Lists<http://www.gromacs.org/Support/Mailing_Lists>
>>
>
>
--
gmx-users mailing list    gmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Reply via email to