> 在 2018年8月3日,06:27,Jeff Law <l...@redhat.com> 写道:
> 
> On 07/26/2018 12:06 AM, 瞿仙淼 wrote:
>> 
>>> 在 2018年7月25日,上午5:24,Jeff Law <l...@redhat.com> 写道:
>>> 
>>> On 07/24/2018 12:18 PM, Sandra Loosemore wrote:
>>>> On 07/24/2018 09:45 AM, Jeff Law wrote:
>>>>> On 07/23/2018 10:21 PM, Sandra Loosemore wrote:
>>>>> I'm not a big fan of more awk code, but I'm not going to object to it :-)
>>>>> 
>>>>> Why does the port have its own little pass for condition code
>>>>> optimization (cse_cc)?  What is it doing that can't be done with our
>>>>> generic optimizers?
>>>> 
>>>> This pass was included in the initial patch set we got from C-SKY, and
>>>> as it didn't seem to break anything I left it in.  Perhaps C-SKY can
>>>> provide a testcase that demonstrates why it's still useful in the
>>>> current version of GCC; otherwise we can remove this from the initial
>>>> port submission and restore it later if some performance analysis shows
>>>> it is still worthwhile.
>>> FWIW it looks like we model CC setting on just a few insns, (add,
>>> subtract) so I'd be surprised if this little mini pass found much.  I'd
>>> definitely like to hear from the csky authors here.
>>> 
>>> Alternately, you could do add some instrumentation to flag when it
>>> triggers, take a test or two that does, reduce it and we can then look
>>> at the key RTL sequences and see what the pass is really doing.
>>> 
>> 
>> I wrote a case to reproduce this problem on C-SKY. C code is as follows:
>> -----------------------------------------------------------------------
>> int e1, e2;
>> 
>> void func (int a, int b, int c, int d, int f, int g)
>> {
>>  e1 = a > b ? f : g;
>>  e2 = a > b ? c : d;
>> 
>>  return;
>> }
>> -----------------------------------------------------------------------
>> 
>> compile to assembler with option “-O3 -S” :
>> -----------------------------------------------------------------------
>> func:
>>  cmplt a1, a0
>>  ld.w  t1, (sp, 0)
>>  ld.w  t0, (sp, 4)
>>  movt  t0, t1
>>  cmplt a1, a0
>>  movt  a3, a2
>>  lrw a1, e2
>>  lrw a2, e1
>>  st.w  a3, (a1, 0)
>>  st.w  t0, (a2, 0)
>>  rts
>> -----------------------------------------------------------------------
>> There is an extra “cmplt a1, a0" in the above code without cse_cc. This 
>> situation mainly occurs when a relatively short branch jump is converted 
>> into a conditional execution instruction. And the CSE pass can not reduce 
>> the same conditional comparison instruction . Here is the rtx sequence after 
>> “cse2” pass.
>> 
>> -----------------------------------------------------------------------
>> (insn 28 13 29 2 (set (reg:CC 33 c)
>>        (gt:CC (reg/v:SI 77 [ a ])
>>            (reg/v:SI 78 [ b ]))) func.c:5 1099 {*cmpgtsi}
>>     (nil))
>> (insn 29 28 30 2 (set (reg/v:SI 82 [ g ])
>>        (if_then_else:SI (eq (reg:CC 33 c)
>>                (const_int 0 [0]))
>>            (reg/v:SI 82 [ g ])
>>            (reg/v:SI 81 [ f ]))) func.c:5 983 {movf}
>>     (expr_list:REG_DEAD (reg/v:SI 81 [ f ])
>>        (expr_list:REG_DEAD (reg:CC 33 c)
>>            (nil))))
>> (insn 30 29 31 2 (set (reg:CC 33 c)
>>        (gt:CC (reg/v:SI 77 [ a ])
>>            (reg/v:SI 78 [ b ]))) func.c:5 1099 {*cmpgtsi}
>>     (expr_list:REG_DEAD (reg/v:SI 78 [ b ])
>>        (expr_list:REG_DEAD (reg/v:SI 77 [ a ])
>>            (nil))))
>> (insn 31 30 18 2 (set (reg/v:SI 80 [ d ])
>>        (if_then_else:SI (eq (reg:CC 33 c)
>>                (const_int 0 [0]))
>>            (reg/v:SI 80 [ d ])
>>            (reg/v:SI 79 [ c ]))) func.c:5 983 {movf}
>>     (expr_list:REG_DEAD (reg/v:SI 79 [ c ])
>>        (expr_list:REG_DEAD (reg:CC 33 c)
>>            (nil))))
>> -----------------------------------------------------------------------
>> 
>> It doesn't seem to check the same conditional comparison instruction .So I 
>> wrote this to solve this problem, but I am not sure if this is the best way 
>> : )
>> 
>> PS, the same conditional comparison instruction cannot be reduced with the 
>> latest version gcc with C-SKY because I just insert the “cse_cc” after 
>> “cse1”, when I insert after “cse2”, this problem can be solved very well.
> I think the cse_cc pass is really just working around one or more bugs
> in CSE and/or a backend bug.  The RTL above clearly shows a common
> subexpression that is not eliminated by CSE.
> 
> What CSE should be trying to do is changing the second and third
> occurrences of (gt:CC (reg 77) (reg 78)) with (reg 33) which would
> create nop-sets which would be subsequently deleted.  I suspect you do
> not have an insn which matches that nop set of the CC register.  If you
> fix that I suspect CSE will work better and eliminate the need for your
> cse_cc pass.

Thanks you for your suggestions, we will try this method.

> 
> jeff

Reply via email to