https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118662

--- Comment #17 from rguenther at suse dot de <rguenther at suse dot de> ---
> Am 27.01.2025 um 17:38 schrieb ubizjak at gmail dot com 
> <gcc-bugzi...@gcc.gnu.org>:
> 
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118662
> 
> --- Comment #15 from Uroš Bizjak <ubizjak at gmail dot com> ---
> The testcase now generates (-O2 -ftree-slp-vectorize -fno-vect-cost-model
> -msse4):
> 
> addup:
>        pmovsxbd        (%rdi), %xmm0
>        movd    (%rdi), %xmm1
>        movdqa  %xmm0, %xmm2
>        pextrb  $3, %xmm1, %edx
>        ...
> 
> One possible improvement would be to move QImode value to %xmm1 and 
> sign-extend
> to %xmm0 from %xmm1. Something like:
> 
> addup:
>        movd    (%rdi), %xmm1
>        pmovsxbd        %xmm1, %xmm0
>        movdqa  %xmm0, %xmm2
>        pextrb  $3, %xmm1, %edx
>        ...
> 
> This would save memory read.

IIRC there are a few bugreports that we tend to Duplikate Memory loads into
instructions that can handle memory operands.

> 
> --
> You are receiving this mail because:
> You are the assignee for the bug.
> You are on the CC list for the bug.

Reply via email to