Hi Ajit,

on 2023/12/8 16:01, Ajit Agarwal wrote:
> Hello Kewen:
> 

[snip...]

> With UNSPEC_MMA_EXTRACT I could generate the register pair but functionally 
> here is the
> below code which is incorrect.
> 
>  l    lxvp %vs0,0(%r4)
>         xxlor %vs32,%vs0,%vs0
>         xvf32ger 0,%vs34,%vs32
>         xvf32gerpp 0,%vs34,%vs33
>         xxmfacc 0
>         stxvp %vs2,0(%r3)
>         stxvp %vs0,32(%r3)
>         blr
> 
> 
> Here is the RTL Code:
> 
> (insn 19 4 20 2 (set (reg:OO 124 [ *ptr_4(D) ])
>         (mem:OO (reg/v/f:DI 122 [ ptr ]) [0 *ptr_4(D)+0 S16 A128])) -1
>      (nil))
> (insn 20 19 9 2 (set (reg:V16QI 129 [orig:124 *ptr_4(D) ] [124])
>         (subreg:V16QI (reg:OO 124 [ *ptr_4(D) ]) 0)) -1
>      (nil))
> (insn 9 20 11 2 (set (reg:XO 119 [ _7 ])
>         (unspec:XO [
>                 (reg/v:V16QI 123 [ src ])
>                 (reg:V16QI 129 [orig:124 *ptr_4(D) ] [124])
>             ] UNSPEC_MMA_XVF32GER)) 2195 {mma_xvf32ger}
>      (expr_list:REG_DEAD (reg:OO 124 [ *ptr_4(D) ])
>         (nil)))
> (insn 11 9 12 2 (set (reg:XO 120 [ _9 ])
>         (unspec:XO [
>                 (reg:XO 119 [ _7 ])
>                 (reg/v:V16QI 123 [ src ])
>                 (reg:V16QI 125 [ MEM[(__vector unsigned char *)ptr_4(D) + 
> 16B] ])
>             ] UNSPEC_MMA_XVF32GERPP)) 2209 {mma_xvf32gerpp}
>      (expr_list:REG_DEAD (reg:V16QI 125 [ MEM[(__vector unsigned char 
> *)ptr_4(D) + 16B] ])
>         (expr_list:REG_DEAD (reg/v:V16QI 123 [ src ])
>             (expr_list:REG_DEAD (reg:XO 119 [ _7 ])
>                 (nil)))))
> (insn 12 11 18 2 (set (mem:XO (reg:DI 126) [1 *dst_10(D)+0 S64 A128])
>         (reg:XO 120 [ _9 ])) 
> "../gcc/testsuite/g++.target/powerpc/vecload.C":13:8 2182 {*movxo}
>      (expr_list:REG_DEAD (reg:DI 126)
>         (expr_list:REG_DEAD (reg:XO 120 [ _9 ])
>             (nil))))
> (note 18 12 0 NOTE_INSN_DELETED)
> 
> r124 and r129 conflicts live range amd ira generates different registers 
> which will not
> serve our purpose.
> 
> Making r124 and r129 as same will not allocate register by ira as r124 could 
> have both OOmode
> and V16QImode.
> 
> Doing this pass before ira_pass has such above issues and we could solve them 
> after making
> after reload pass.

Could you also attach your latest WIP patch?  I'm going to look into the extra 
move issue with it.

Thanks!

BR,
Kewen

Reply via email to