: richard.sandiford; gcc-patches
Subject: Re: Re: [PATCH V2] VECT: Support CALL vectorization for COND_LEN_*
On Wed, 2 Aug 2023, juzhe.zh...@rivai.ai wrote:
> Thanks Richard so much.
>
> Forgive me asking question again :)
>
> Is this following code correct for you ?
Well, I wond
CC: richard.sandiford; gcc-patches
Subject: Re: Re: [PATCH V2] VECT: Support CALL vectorization for COND_LEN_*
On Wed, 2 Aug 2023, juzhe.zh...@rivai.ai wrote:
> Thanks Richard so much.
>
> Forgive me asking question again :)
>
> Is this following code correct for you ?
Well, I w
+ else if (reduc_idx >= 0)
> + gcc_unreachable ();
> +}
>
> Thanks.
>
>
> juzhe.zh...@rivai.ai
>
> From: Richard Biener
> Date: 2023-08-02 15:49
> To: ???
> CC: richard.sandiford; gcc-patches
> Subject: Re: Re: [PATCH V2] VECT: Support CALL vectoriz
s += 2;
+ }
+ else if (reduc_idx >= 0)
+ gcc_unreachable ();
+}
Thanks.
juzhe.zh...@rivai.ai
From: Richard Biener
Date: 2023-08-02 15:49
To: 钟居哲
CC: richard.sandiford; gcc-patches
Subject: Re: Re: [PATCH V2] VECT: Support CALL vectorization for COND_LEN_*
On Mon, 31 Jul 2023,
18 : 0.0;
>
> with -ffast-math (probably rightfully so). I then get .FMAs
> vectorized and .COND_FMA folded.
>
> > The thing I wonder is that whether this condtion:
> >
> > if (mask_opno >= 0 && reduc_idx >= 0)
> >
> > or similar as len
> &
t;= 0)
>
> or similar as len
> if (len_opno >= 0 && reduc_idx >= 0)
>
> Whether they are redundant in vectorizable_call ?
>
>
> juzhe.zh...@rivai.ai
>
> From: Richard Biener
> Date: 2023-07-31 21:33
> To: juzhe.zh...@rivai.ai
> CC: richard.sandifor
gt;
> juzhe.zh...@rivai.ai
>
> From: Richard Biener
> Date: 2023-07-31 21:33
> To: juzhe.zh...@rivai.ai
> CC: richard.sandiford; gcc-patches
> Subject: Re: Re: [PATCH V2] VECT: Support CALL vectorization for COND_LEN_*
> On Mon, 31 Jul 2023, juzhe.zh...@rivai.ai wrote:
>
0)
Whether they are redundant in vectorizable_call ?
juzhe.zh...@rivai.ai
From: Richard Biener
Date: 2023-07-31 21:33
To: juzhe.zh...@rivai.ai
CC: richard.sandiford; gcc-patches
Subject: Re: Re: [PATCH V2] VECT: Support CALL vectorization for COND_LEN_*
On Mon, 31 Jul 2023, juzhe.zh...@rivai.ai wrote:
>
-07-31 20:00
> To: juzhe.zh...@rivai.ai
> CC: richard.sandiford; gcc-patches
> Subject: Re: Re: [PATCH V2] VECT: Support CALL vectorization for COND_LEN_*
> On Mon, 31 Jul 2023, juzhe.zh...@rivai.ai wrote:
>
> > Ok . Thanks Richard.
> >
> > Could you give me a case
ate: 2023-07-31 20:00
To: juzhe.zh...@rivai.ai
CC: richard.sandiford; gcc-patches
Subject: Re: Re: [PATCH V2] VECT: Support CALL vectorization for COND_LEN_*
On Mon, 31 Jul 2023, juzhe.zh...@rivai.ai wrote:
> Ok . Thanks Richard.
>
> Could you give me a case that SVE can vectorize
On Mon, 31 Jul 2023, juzhe.zh...@rivai.ai wrote:
> Ok . Thanks Richard.
>
> Could you give me a case that SVE can vectorize a reduction with FMA?
> Meaning it will go into vectorize_call and vectorize FMA into COND_FMA ?
>
> I tried many times to reproduce such cases but I failed.
I think you n
Ok . Thanks Richard.
Could you give me a case that SVE can vectorize a reduction with FMA?
Meaning it will go into vectorize_call and vectorize FMA into COND_FMA ?
I tried many times to reproduce such cases but I failed.
Thanks.
juzhe.zh...@rivai.ai
From: Richard Sandiford
Date: 2023-07-31 1
...@rivai.ai
From: Richard Biener
Date: 2023-07-31 18:45
To: juzhe.zh...@rivai.ai
CC: gcc-patches; richard.sandiford
Subject: Re: Re: [PATCH V2] VECT: Support CALL vectorization for COND_LEN_*
On Mon, 31 Jul 2023, juzhe.zh...@rivai.ai wrote:
> Hi, Richard. Thanks a lot for the comment
>
&
On Mon, 31 Jul 2023, juzhe.zh...@rivai.ai wrote:
> Hi, Richard. Thanks a lot for the comment
>
> >> In your code above
> >> you either use cond_len_fn or get_len_internal_fn (cond_fn) but
> >> isn't that the very same?! So how come you in one case add two
> >> and in the other add four args?
>
Hi, Richard. Thanks a lot for the comment
>> In your code above
>> you either use cond_len_fn or get_len_internal_fn (cond_fn) but
>> isn't that the very same?! So how come you in one case add two
>> and in the other add four args?
cond_len_fn is not the same as get_len_internal_fn (cond_fn) whe
15 matches
Mail list logo