On Wed, Apr 11, 2012 at 12:39 PM, Richard Guenther
<richard.guent...@gmail.com> wrote:
> On Tue, Apr 10, 2012 at 8:43 PM, Igor Zamyatin <izamya...@gmail.com> wrote:
>> Hi All!
>>
>> Here is a patch that enables unroll at O2 for Atom.
>>
>> This gives good performance boost on EEMBC 2.0 (~+8% in Geomean for 32
>> bits) with quite moderate code size increase (~5% for EEMBC2.0, 32
>> bits).
>
> 5% is not moderate.  Your patch does enable unrolling at -O2 but not -O3,
> why? Why do you disable register renaming?  check_imull requires a function
> comment.

Sure, enabling unroll for O3 could be the next step.
We can't avoid code size increase with unroll - what number do you
think will be appropriate?
Register renaming was the reason of several degradations during tuning process
Comment for check_imull was added

>
> This completely looks like a hack for EEMBC2.0, so it's definitely not ok.

Why? EEMBC was measured and result provided here just because this
benchmark considers to be very relevant for Atom

>
> -O2 is not supposed to give best benchmark results.

O2 is wide-used so performance improvement could be important for users.

>
> Thanks,
> Richard.
>
>>
>> Tested for i386 and x86-64, ok for trunk?

Updated patch attached

>>
>> Thanks,
>> Igor
>>
>> ChangeLog:
>>
>> 2012-04-10  Yakovlev Vladimir  <vladimir.b.yakov...@intel.com>
>>
>>        * gcc/config/i386/i386.c (check_imul): New routine.
>>        (ix86_loop_unroll_adjust): New target hook.
>>        (ix86_option_override_internal): Enable unrolling on Atom at -O2.
>>        (TARGET_LOOP_UNROLL_ADJUST): New define.

Attachment: unroll1.patch
Description: Binary data

Reply via email to