On Fri, Jun 3, 2022 at 12:17 PM Jakub Jelinek <ja...@redhat.com> wrote:
>
> Hi!
>
> My PR105778 patch apparently broke the following testcase.
> If the mask has the top relevant bit clear (i.e. we know we are shifting
> by 0 to wordsize bits - 1) but doesn't have all the bits below it set,
> we emit andsi3 before the shift sequence.  When the pattern had :SI
> for that operand, that was just fine, but now that it can be also HImode
> or for -m64 DImode, we either can use a lowpart or paradoxical subreg to
> SImode as the following patch, or we could use a HImode or DImode AND.

I think it is better to leave the operation in its natural mode and
leave the peephole pass to do its magic, depending on the target.

Uros.

> Ok for trunk if it passes bootstrap/regtest on x86_64-linux and i686-linux?
>
> 2022-06-03  Jakub Jelinek  <ja...@redhat.com>
>
>         PR target/105825
>         * config/i386/i386.md (*ashl<dwi>3_doubleword_mask,
>         *<insn><dwi>3_doubleword_mask): If top bit of mask is clear, but lower
>         bits of mask aren't all set and operands[2] doesn't have SImode,
>         force it to get and use subreg to SImode for andsi3 operand.
>
>         * gcc.dg/pr105825.c: New test.
>
> --- gcc/config/i386/i386.md.jj  2022-06-02 10:40:00.034660893 +0200
> +++ gcc/config/i386/i386.md     2022-06-03 12:00:39.323292767 +0200
> @@ -11935,6 +11935,11 @@ (define_insn_and_split "*ashl<dwi>3_doub
>        != ((<MODE_SIZE> * BITS_PER_UNIT) - 1))
>      {
>        rtx tem = gen_reg_rtx (SImode);
> +      if (GET_MODE (operands[2]) != SImode)
> +       {
> +         operands[2] = force_reg (GET_MODE (operands[2]), operands[2]);
> +         operands[2] = gen_lowpart (SImode, operands[2]);
> +       }
>        emit_insn (gen_andsi3 (tem, operands[2], operands[3]));
>        operands[2] = tem;
>      }
> @@ -12900,6 +12905,11 @@ (define_insn_and_split "*<insn><dwi>3_do
>        != ((<MODE_SIZE> * BITS_PER_UNIT) - 1))
>      {
>        rtx tem = gen_reg_rtx (SImode);
> +      if (GET_MODE (operands[2]) != SImode)
> +       {
> +         operands[2] = force_reg (GET_MODE (operands[2]), operands[2]);
> +         operands[2] = gen_lowpart (SImode, operands[2]);
> +       }
>        emit_insn (gen_andsi3 (tem, operands[2], operands[3]));
>        operands[2] = tem;
>      }
> --- gcc/testsuite/gcc.dg/pr105825.c.jj  2022-06-03 12:01:58.008460659 +0200
> +++ gcc/testsuite/gcc.dg/pr105825.c     2022-06-03 12:01:41.259637783 +0200
> @@ -0,0 +1,13 @@
> +/* PR target/105825 */
> +/* { dg-do compile { target int128 } } */
> +/* { dg-options "-O2" } */
> +/* { dg-additional-options "-mavx" { target avx } } */
> +
> +__int128 j;
> +int i;
> +
> +void
> +foo (void)
> +{
> +  j <<= __builtin_parityll (i);
> +}
>
>         Jakub
>

Reply via email to