Steve Cousins wrote:

I just thought I'd mention some troubles and fixes I had with trying to get Gromacs 4.0.5 to configure with --with-fft=mkl on a SGI Altix 3700 BX2 system. I did:

export CC=icc
export F77=ifort
export CFLAGS="-O3 -ip -ftz"
export FFLAGS="-O3 -ip -ftz"
export LDFLAGS="-O3 -ip -ftz -L/opt/intel/Compiler/11.1/046/mkl/lib/64 -L/usr/lib" export CPPFLAGS="-I/opt/intel/Compiler/11.1/046/mkl/include/fftw -I/usr/include"

and then:

./configure --prefix=/usr/local/gromacs-4-mkl-noopts --without-x --enable-fortran --with-fft=mkl

However, this gave messages saying that it couldn't find the mkl libraries. This is because MKL doesn't include a library called libmkl.so anymore. To get this to work I had to edit the configure script and change:

    -lmkl

to

    -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -lguide -lpthread

... or put LIBS="-lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -lguide -lpthread" on the configure command line.

In the end, I found that using MKL is about 20% slower than using fftw3 when running:

    time gmxtest.pl all

This may or may not be useful as they are just serial results with the small test programs:

With fftw3:

    real    1m17.889s
    user    1m4.660s
    sys     0m8.672s

With MKL:

    real    1m34.731s
    user    3m33.024s
    sys     0m15.280s

Maybe the overhead of starting up the threads for such small jobs is what is causing the slow-down in MKL.

Plausible, but even if FFT threads were not available, only a fraction of these tests were using FFT and some of those were probably I/O dominated. So the relative speed is still unknown. These tests are designed to assess correct implementation and compilation, and not as benchmarks or real-world examples. There was a benchmark set years ago for the last major GROMACS release, which will probably still serve your purpose http://oldwww.gromacs.org/content/view/24/37/. Lys/PME is the relevant test. Bear in mind that grompp no longer accepts -np, -sort or -shuffle arguments, and these should be deleted as necessary. Your speed should probably beat any of those reported there.

Anybody have any real-world comparisons of using MKL vs. FFTW3?

No, but in the meantime build both, and if your users are actually using PME (the main algorithm that uses FFT), they can do a simple speed tests once they have a full-sized system set up.

Mark
_______________________________________________
gmx-users mailing list    gmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
Please search the archive at http://www.gromacs.org/search before posting!
Please don't post (un)subscribe requests to the list. Use the www interface or send it to gmx-users-requ...@gromacs.org.
Can't post? Read http://www.gromacs.org/mailing_lists/users.php

Reply via email to