Re: [PATCH][i386] Split reductions (was: Re: [PATCH][RFC][i386] Change sminmax reduction patterns)

2018-10-09 Thread Uros Bizjak
On Tue, Oct 9, 2018 at 3:28 PM Richard Biener wrote: > > On Mon, 8 Oct 2018, Richard Biener wrote: > > > On Fri, 5 Oct 2018, Uros Bizjak wrote: > > > > > On Thu, Oct 4, 2018 at 2:05 PM Richard Biener wrote: > > > > > > > > > > > > This tries to apply the same trick to sminmax reduction patterns >

Re: [PATCH][i386] Split reductions (was: Re: [PATCH][RFC][i386] Change sminmax reduction patterns)

2018-10-09 Thread Richard Biener
On Mon, 8 Oct 2018, Richard Biener wrote: > On Fri, 5 Oct 2018, Uros Bizjak wrote: > > > On Thu, Oct 4, 2018 at 2:05 PM Richard Biener wrote: > > > > > > > > > This tries to apply the same trick to sminmax reduction patterns > > > as for the reduc_plus_scal ones, namely reduce %zmm -> %ymm -> %x

[PATCH][i386] Split reductions (was: Re: [PATCH][RFC][i386] Change sminmax reduction patterns)

2018-10-08 Thread Richard Biener
On Fri, 5 Oct 2018, Uros Bizjak wrote: > On Thu, Oct 4, 2018 at 2:05 PM Richard Biener wrote: > > > > > > This tries to apply the same trick to sminmax reduction patterns > > as for the reduc_plus_scal ones, namely reduce %zmm -> %ymm -> %xmm > > first. On a microbenchmark this improves performa

Re: [PATCH][RFC][i386] Change sminmax reduction patterns

2018-10-05 Thread Uros Bizjak
On Thu, Oct 4, 2018 at 2:05 PM Richard Biener wrote: > > > This tries to apply the same trick to sminmax reduction patterns > as for the reduc_plus_scal ones, namely reduce %zmm -> %ymm -> %xmm > first. On a microbenchmark this improves performance on Zen > by ~30% for AVX2 and on Skylake-SP by ~

Re: [PATCH][RFC][i386] Change sminmax reduction patterns

2018-10-04 Thread Richard Biener
On Thu, 4 Oct 2018, Richard Biener wrote: > > This tries to apply the same trick to sminmax reduction patterns > as for the reduc_plus_scal ones, namely reduce %zmm -> %ymm -> %xmm > first. On a microbenchmark this improves performance on Zen > by ~30% for AVX2 and on Skylake-SP by ~10% for AVX5

[PATCH][RFC][i386] Change sminmax reduction patterns

2018-10-04 Thread Richard Biener
This tries to apply the same trick to sminmax reduction patterns as for the reduc_plus_scal ones, namely reduce %zmm -> %ymm -> %xmm first. On a microbenchmark this improves performance on Zen by ~30% for AVX2 and on Skylake-SP by ~10% for AVX512 (for AVX2 there's no measurable difference). I g