------- Comment #8 from tkoenig at gcc dot gnu dot org 2010-05-14 09:15 ------- New timings, on x86_64-unknown-linux-gnu. I split off the "invalidate" subroutine to make sure the optimizers don't optimize this out:
i...@linux-fd1f:/tmp> gfortran -O3 matmul.f90 invalidate.f90 i...@linux-fd1f:/tmp> time ./a.out matmul = 11.100311 s subroutine with explicit interface: 2.0216932 s unrolled subroutine with explicit interface: 1.9317064 s inline with sum 1.9087105 s real 0m16.971s user 0m16.959s sys 0m0.005s -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=37131