On Mon, 11 Nov 2024, Richard Biener wrote:
> The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the
> vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512
> vectorized loops when set. The tuning is enabled by default for Zen4
> and Zen5 where I benchmarked it to b
On Tue, 12 Nov 2024, Hongtao Liu wrote:
> On Mon, Nov 11, 2024 at 8:20 PM Richard Biener wrote:
> >
> > The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the
> > vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512
> > vectorized loops when set. The tuning is en
On Mon, Nov 11, 2024 at 8:20 PM Richard Biener wrote:
>
> The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the
> vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512
> vectorized loops when set. The tuning is enabled by default for Zen4
> and Zen5 where I benchm
> Am 11.11.2024 um 18:09 schrieb Jan Hubicka :
>
>
>>
>> The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the
>> vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512
>> vectorized loops when set. The tuning is enabled by default for Zen4
>> and Zen5 where
> The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the
> vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512
> vectorized loops when set. The tuning is enabled by default for Zen4
> and Zen5 where I benchmarked it to be overall positive on SPEC CPU 2017 both
> i