Sebastian,

Here are (attached) results for testing HIRLAM with and without -ftree-loop-linear.

As you can see, the results are neutral: 4 loops fewer vectorized, but about 50 fewer recognized.

Now I like to redo that test with -ftree-loop-distribution. Can you send me a patch against the trunk (otherwise it won't be a fair comparison).

Kind regards,

--
Toon Moene - e-mail: [EMAIL PROTECTED] - phone: +31 346 214290
Saturnushof 14, 3738 XG  Maartensdijk, The Netherlands
At home: http://moene.indiv.nluug.nl/~toon/
GNU Fortran's path to Fortran 2003: http://gcc.gnu.org/wiki/Fortran2003
Baseline, no source changes:

Mon Dec 10 17:45:19 UTC 2007 (revision 130746)

Compilation flags:

CCFLAGS := -g -O3 $(MACHINECPP) -ffast-math -fno-associative-math -march=native 
-mtune=native -ftree-vectorizer-verbose=2
FCFLAGS := -g -O3 -fbacktrace -ffpe-trap=invalid,zero,overflow -ffast-math 
-fno-associative-math -march=native -mtune=native -ftree-vectorizer-verbose=2

Loops vectorized:
5675
Loops not vectorized:
13705

Timings:
20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK    12.7488 SECONDS
20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK  2445.9609 SECONDS
20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK   259.3362 SECONDS
20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK    12.4408 SECONDS
20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK   305.9351 SECONDS
20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK   262.1124 SECONDS
20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK    12.7448 SECONDS
20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK  2323.3733 SECONDS
20061201_12r/HL_Cycle_2006120112r.html: FORECAST TOOK   412.7058 SECONDS
20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK   264.5685 SECONDS
20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK    12.6648 SECONDS
20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK   306.7352 SECONDS
20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK   261.5164 SECONDS
20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK    12.7688 SECONDS
20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK  2325.3774 SECONDS
20061202_00r/HL_Cycle_2006120200r.html: FORECAST TOOK   413.8739 SECONDS

Baseline, no source changes, with -ftree-loop-linear:

Mon Dec 10 17:45:19 UTC 2007 (revision 130746)

Compilation flags:

CCFLAGS := -g -O3 $(MACHINECPP) -ftree-loop-linear -ffast-math 
-fno-associative-math -march=native -mtune=native -ftree-vectorizer-verbose=2
FCFLAGS := -g -O3 -ftree-loop-linear -fbacktrace 
-ffpe-trap=invalid,zero,overflow -ffast-math -fno-associative-math 
-march=native -mtune=native -ftree-vectorizer-verbose=2

This compilation got one ICE:

rttov_aitosu.f90: In function 'rttov_aitosu':
rttov_aitosu.f90:4: error: definition in block 262 does not dominate use in 
block 134
for SSA_NAME: pretmp.240_59 in statement:
prephitmp.220_58 = PHI <pretmp.240_59(134), D.1480_1373(138)>
PHI argument
pretmp.240_59
for PHI node
prephitmp.220_58 = PHI <pretmp.240_59(134), D.1480_1373(138)>
rttov_aitosu.f90:4: internal compiler error: verify_ssa failed
Please submit a full bug report,
with preprocessed source if appropriate.
See <http://gcc.gnu.org/bugs.html> for instructions.

Worked around by compiling this file without -ftree-loop-linear

Loops vectorized:
5671
Loops not vectorized:
13655

Timings:
20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK    12.5648 SECONDS
20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK  2444.1208 SECONDS
20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK   259.3402 SECONDS
20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK    12.4728 SECONDS
20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK   307.8672 SECONDS
20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK   260.0323 SECONDS
20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK    12.8608 SECONDS
20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK  2310.2485 SECONDS
20061201_12r/HL_Cycle_2006120112r.html: FORECAST TOOK   411.3977 SECONDS
20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK   261.1283 SECONDS
20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK    12.7248 SECONDS
20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK   308.1313 SECONDS
20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK   262.7564 SECONDS
20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK    12.6528 SECONDS
20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK  2336.5620 SECONDS
20061202_00r/HL_Cycle_2006120200r.html: FORECAST TOOK   410.6577 SECONDS

Reply via email to