>On Mon, 28 Mar 2005, James E Wilson wrote: >> Steven Bosscher wrote: >>> OK, so I know this is not a popular subject, but can we *please* stop >>> working on loop.c and focus on getting the new RTL and tree loop passes >>> to do what we want? >> I don't think anyone is objecting to this. [...] >> I would however make a distinction here between new development work and >> maintenance. It would be better if new development work happened in the new >> loop optimizer. However, we still need to do maintenance work in loop.c. > >...and since Canqun reported 2.5% improvement on SPEC CFP2000 on ia64 with >his current patch, I really think we should consider it. >
Besides this, I¡¯ve got another patch for improving general induction variable optimizations defined in loop.c. With these two patches and properly setting the loop unrolling parameters, the tests of both NAS and SPEC CPU2000 benchmarks on IA-64 1GHz system show a good result. 1. The following table shows the test result of NAS benchmarks: Gcc-20050404 Gcc-20050404 Ratio + Optimized. Bt.W 22.16s 22.68s 0.98 Cg.A 9.23s 7.45s 1.24 Ep.W 12.3s 11.97s 1.03 Ft.A 38.41s 25.98s 1.48 Is.B 34.94s 33.47s 1.04 Lu.W 32.93s 31.59s 1.04 Mg.A 21.91s 14.64s 1.50 Sp.W 59.71s 55.67s 1.07 Geomean 1.16 "Gcc-20050404" is the GCC mainline version dated on April 4, 2005. It includes my previous patch of RECORD_TYPE for COMMON blocks without equivalence objects. The compile options for ¡°Gcc-20050404¡± is ¡°-O3 -funroll-loops -fprefetch-loop-arrays¡±, and ¡°- O3 -funroll-loops -fprefetch-loop-arrays --param max- unrolled-insns=600 --param max-average-unrolled- insns=320¡± for¡°Gcc-20050404+Optimized¡±. 2. The SPEC CFP2000 test uses the same options as above. ¡°Gcc-20050404¡± got 426 SPEC ratio, and ¡°Gcc- 20050404 + Optimized¡± got 459 SPEC ratio. You can download the attachments to see more details. And if the address giv splitting were not miss in the new loop unroller, the SPEC ratio up to 513 can be expected. >We all know how hard it is to get this kind of improvement on any of the >SPECs -- and in fact improving the current optimizers will make raise >the >bar for the new ones. ;-) > >Question is: who is going review/potentially approve this patch? > >Gerald Canqun Yang Creative Compiler Research Group. National University of Defense Technology, China.
CFP2000.154.pdf
Description: Adobe PDF document
CFP2000.155.pdf
Description: Adobe PDF document