I just ran a spec 2006 run on the powerpc (32-bit) last night setting the reassociation to 2. I do see a win in bwaves, but unfortunately it is not enough of a win, and it is still a regression to GCC 4.5. However, I see some regressions in 3 other benchmarks (I tend to omit differences of less than 2%):
401.bzip2 97.99% 410.bwaves 113.88% 436.cactusADM 93.96% 444.namd 93.74% The profile differences are as follows. Unfortunately, I'm not sure I can post sample counts under Spec rules: Bzip2: GCC 4.7 GCC 4.7 with patches Function ======= ==================== ======== 28.96% 28.39% mainSort 15.94% 15.49% BZ2_decompress 12.56% 12.35% mainGtU.part.0 11.59% 11.54% generateMTFValues 8.89% 9.04% fallbackSort 6.60% 8.28% BZ2_compressBlock 7.48% 7.21% handle_compress.isra.2 6.24% 5.95% BZ2_bzDecompress 0.55% 0.58% add_pair_to_block 0.54% 0.54% BZ2_hbMakeCodeLengths Bwaves: GCC 4.7 GCC 4.7 with patches Function ======= ==================== ======== 78.70% 74.73% mat_times_vec_ 11.68% 13.21% bi_cgstab_block_ 6.72% 8.47% shell_ 2.11% 2.62% jacobian_ 0.79% 0.96% flux_ CactusADM: GCC 4.7 GCC 4.7 with patches Function ======= ==================== ======== 99.67% 99.69% bench_staggeredleapfrog2_ Namd: GCC 4.7 GCC 4.7 with patches Function ======= ==================== ======== 15.43% 14.71% _ZN20ComputeNonbondedUtil26calc_pair_energy_fullelectEP9nonbonded.part.39 11.94% 11.80% _ZN20ComputeNonbondedUtil19calc_pair_fullelectEP9nonbonded.part.40 10.18% 11.52% _ZN20ComputeNonbondedUtil32calc_pair_energy_merge_fullelectEP9nonbonded.part.37 9.87% 9.02% _ZN20ComputeNonbondedUtil16calc_pair_energyEP9nonbonded.part.41 9.55% 8.85% _ZN20ComputeNonbondedUtil9calc_pairEP9nonbonded.part.42 9.52% 9.05% _ZN20ComputeNonbondedUtil25calc_pair_merge_fullelectEP9nonbonded.part.38 7.24% 8.72% _ZN20ComputeNonbondedUtil26calc_self_energy_fullelectEP9nonbonded.part.31 6.28% 6.42% _ZN20ComputeNonbondedUtil19calc_self_fullelectEP9nonbonded.part.32 5.23% 6.18% _ZN20ComputeNonbondedUtil32calc_self_energy_merge_fullelectEP9nonbonded.part.29 5.13% 4.66% _ZN20ComputeNonbondedUtil16calc_self_energyEP9nonbonded.part.33 4.72% 4.43% _ZN20ComputeNonbondedUtil25calc_self_merge_fullelectEP9nonbonded.part.30 4.60% 4.37% _ZN20ComputeNonbondedUtil9calc_selfEP9nonbonded.part.34 -- Michael Meissner, IBM 5 Technology Place Drive, M/S 2757, Westford, MA 01886-3141, USA meiss...@linux.vnet.ibm.com fax +1 (978) 399-6899