On Fri, Oct 4, 2024 at 12:12 PM Jakub Jelinek <ja...@redhat.com> wrote:
>
> Hi!
>
> The following testcase ICEs, because the ix86_expand_int_compare
> optimization to use {,v}ptest assumes there are instructions for all
> 16-byte vector modes.  That isn't the case, we only have one for
> V16QI, V8HI, V4SI, V2DI, V1TI, V4SF and V2DF, not for
> V8HF nor V8BF.
>
> The following patch fixes that by using the V8HI instruction instead
> for those 2 modes.  tmp can't be a SUBREG, because it is SUBREG_REG
> of another SUBREG, so we don't need to worry about gen_lowpart
> failing.
>
> Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?
>
> 2024-10-04  Jakub Jelinek  <ja...@redhat.com>
>
>         PR target/116921
>         * config/i386/i386-expand.cc (ix86_expand_int_compare): Add a SUBREG
>         to V8HImode from V8HFmode or V8BFmode before generating a ptest.
>
>         * gcc.target/i386/pr116921.c: New test.

OK.

Thanks,
Uros.
>
> --- gcc/config/i386/i386-expand.cc.jj   2024-10-03 17:27:28.328227793 +0200
> +++ gcc/config/i386/i386-expand.cc      2024-10-03 18:11:18.514076904 +0200
> @@ -3095,6 +3095,9 @@ ix86_expand_int_compare (enum rtx_code c
>        && GET_MODE_SIZE (GET_MODE (SUBREG_REG (op0))) == 16)
>      {
>        tmp = SUBREG_REG (op0);
> +      if (GET_MODE_INNER (GET_MODE (tmp)) == HFmode
> +         || GET_MODE_INNER (GET_MODE (tmp)) == BFmode)
> +       tmp = gen_lowpart (V8HImode, tmp);
>        tmp = gen_rtx_UNSPEC (CCZmode, gen_rtvec (2, tmp, tmp), UNSPEC_PTEST);
>      }
>    else
> --- gcc/testsuite/gcc.target/i386/pr116921.c.jj 2024-10-03 18:16:36.368711747 
> +0200
> +++ gcc/testsuite/gcc.target/i386/pr116921.c    2024-10-03 18:17:25.702034243 
> +0200
> @@ -0,0 +1,12 @@
> +/* PR target/116921 */
> +/* { dg-do compile { target int128 } } */
> +/* { dg-options "-O2 -msse4" } */
> +
> +long x;
> +_Float16 __attribute__((__vector_size__ (16))) f;
> +
> +void
> +foo (void)
> +{
> +  x -= !(__int128) (f / 2);
> +}
>
>         Jakub
>

Reply via email to