Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-11-02 Thread Timothy Arceri
On Sat, 2014-11-01 at 23:15 +, Bruno Jimenez wrote: > On Wed, 2014-10-29 at 23:09 +1100, Timothy Arceri wrote: > > On Wed, 2014-10-29 at 16:58 +1100, Timothy Arceri wrote: > > > On Tue, 2014-10-28 at 22:14 +, Bruno Jimenez wrote: > > > > Hi, > > > > > > > > I haven't had time to play yet w

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-11-01 Thread Bruno Jimenez
On Wed, 2014-10-29 at 23:09 +1100, Timothy Arceri wrote: > On Wed, 2014-10-29 at 16:58 +1100, Timothy Arceri wrote: > > On Tue, 2014-10-28 at 22:14 +, Bruno Jimenez wrote: > > > Hi, > > > > > > I haven't had time to play yet with OpenMP, but I have seen the assembly > > > it produces in my com

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-29 Thread Timothy Arceri
On Wed, 2014-10-29 at 16:58 +1100, Timothy Arceri wrote: > On Tue, 2014-10-28 at 22:14 +, Bruno Jimenez wrote: > > Hi, > > > > I haven't had time to play yet with OpenMP, but I have seen the assembly > > it produces in my computer. If I enable SSE2 it can use it, and if I > > enable SSE4.1 it

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-28 Thread Timothy Arceri
On Tue, 2014-10-28 at 22:14 +, Bruno Jimenez wrote: > On Wed, 2014-10-29 at 07:49 +1100, Timothy Arceri wrote: > > On Mon, 2014-10-27 at 20:04 +, Bruno Jimenez wrote: > > > [snip] > > > > >> + > > > > >> + if (aligned_count >= 4) { > > > > > ^^ > > > > > > > > >

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-28 Thread Bruno Jimenez
On Wed, 2014-10-29 at 07:49 +1100, Timothy Arceri wrote: > On Mon, 2014-10-27 at 20:04 +, Bruno Jimenez wrote: > > [snip] > > > >> + > > > >> + if (aligned_count >= 4) { > > > > ^^ > > > > > > > > Hi, > > > > > > > > I have been thinking and I think that you can ch

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-28 Thread Timothy Arceri
On Mon, 2014-10-27 at 20:04 +, Bruno Jimenez wrote: > [snip] > > >> + > > >> + if (aligned_count >= 4) { > > > ^^ > > > > > > Hi, > > > > > > I have been thinking and I think that you can change that 4 for an 8. In > > > the case aligned_count == 4 there's no gain

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-27 Thread Bruno Jimenez
[snip] > >> + > >> + if (aligned_count >= 4) { > > ^^ > > > > Hi, > > > > I have been thinking and I think that you can change that 4 for an 8. In > > the case aligned_count == 4 there's no gain in using SSE, as you will > > have to do a final reduction from 4 elements

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-27 Thread Ian Romanick
On 10/27/2014 08:04 AM, Bruno Jimenez wrote: > On Mon, 2014-10-27 at 08:37 +1100, Timothy Arceri wrote: >> Makes use of SSE to speed up compute of min and max elements >> >> Callgrind cpu usage results from pts benchmarks: >> >> Openarena 0.8.8: 3.67% -> 1.03% >> UrbanTerror: 2.36% -> 0.81% >> >> S

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-27 Thread Bruno Jimenez
On Mon, 2014-10-27 at 08:37 +1100, Timothy Arceri wrote: > Makes use of SSE to speed up compute of min and max elements > > Callgrind cpu usage results from pts benchmarks: > > Openarena 0.8.8: 3.67% -> 1.03% > UrbanTerror: 2.36% -> 0.81% > > Signed-off-by: Timothy Arceri > --- > src/mesa/Andr

Re: [Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-26 Thread Emil Velikov
On 26/10/14 21:37, Timothy Arceri wrote: > Makes use of SSE to speed up compute of min and max elements > > Callgrind cpu usage results from pts benchmarks: > > Openarena 0.8.8: 3.67% -> 1.03% > UrbanTerror: 2.36% -> 0.81% > > Signed-off-by: Timothy Arceri > --- > src/mesa/Android.libmesa_dric

[Mesa-dev] [PATCH V4] mesa: add SSE optimisation for glDrawElements

2014-10-26 Thread Timothy Arceri
Makes use of SSE to speed up compute of min and max elements Callgrind cpu usage results from pts benchmarks: Openarena 0.8.8: 3.67% -> 1.03% UrbanTerror: 2.36% -> 0.81% Signed-off-by: Timothy Arceri --- src/mesa/Android.libmesa_dricore.mk | 3 +- src/mesa/Makefile.am| 3 +-