Re: [PATCH 2/2] Add X86_TUNE_AVX512_TWO_EPILOGUES, enable for Zen4 and Zen5

2024-11-12 Thread Richard Biener
On Mon, 11 Nov 2024, Richard Biener wrote: > The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the > vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512 > vectorized loops when set. The tuning is enabled by default for Zen4 > and Zen5 where I benchmarked it to b

Re: [PATCH 2/2] Add X86_TUNE_AVX512_TWO_EPILOGUES, enable for Zen4 and Zen5

2024-11-12 Thread Richard Biener
On Tue, 12 Nov 2024, Hongtao Liu wrote: > On Mon, Nov 11, 2024 at 8:20 PM Richard Biener wrote: > > > > The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the > > vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512 > > vectorized loops when set. The tuning is en

Re: [PATCH 2/2] Add X86_TUNE_AVX512_TWO_EPILOGUES, enable for Zen4 and Zen5

2024-11-11 Thread Hongtao Liu
On Mon, Nov 11, 2024 at 8:20 PM Richard Biener wrote: > > The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the > vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512 > vectorized loops when set. The tuning is enabled by default for Zen4 > and Zen5 where I benchm

Re: [PATCH 2/2] Add X86_TUNE_AVX512_TWO_EPILOGUES, enable for Zen4 and Zen5

2024-11-11 Thread Richard Biener
> Am 11.11.2024 um 18:09 schrieb Jan Hubicka : > >  >> >> The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the >> vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512 >> vectorized loops when set. The tuning is enabled by default for Zen4 >> and Zen5 where

Re: [PATCH 2/2] Add X86_TUNE_AVX512_TWO_EPILOGUES, enable for Zen4 and Zen5

2024-11-11 Thread Jan Hubicka
> The following adds X86_TUNE_AVX512_TWO_EPILOGUES tuning and directs the > vectorizer to produce both a vector AVX2 and SSE epilogue for AVX512 > vectorized loops when set. The tuning is enabled by default for Zen4 > and Zen5 where I benchmarked it to be overall positive on SPEC CPU 2017 both > i