On Sat, 2014-11-01 at 23:15 +, Bruno Jimenez wrote:
> On Wed, 2014-10-29 at 23:09 +1100, Timothy Arceri wrote:
> > On Wed, 2014-10-29 at 16:58 +1100, Timothy Arceri wrote:
> > > On Tue, 2014-10-28 at 22:14 +, Bruno Jimenez wrote:
> > > > Hi,
> > > >
> > > > I haven't had time to play yet w
On Wed, 2014-10-29 at 23:09 +1100, Timothy Arceri wrote:
> On Wed, 2014-10-29 at 16:58 +1100, Timothy Arceri wrote:
> > On Tue, 2014-10-28 at 22:14 +, Bruno Jimenez wrote:
> > > Hi,
> > >
> > > I haven't had time to play yet with OpenMP, but I have seen the assembly
> > > it produces in my com
On Wed, 2014-10-29 at 16:58 +1100, Timothy Arceri wrote:
> On Tue, 2014-10-28 at 22:14 +, Bruno Jimenez wrote:
> > Hi,
> >
> > I haven't had time to play yet with OpenMP, but I have seen the assembly
> > it produces in my computer. If I enable SSE2 it can use it, and if I
> > enable SSE4.1 it
On Tue, 2014-10-28 at 22:14 +, Bruno Jimenez wrote:
> On Wed, 2014-10-29 at 07:49 +1100, Timothy Arceri wrote:
> > On Mon, 2014-10-27 at 20:04 +, Bruno Jimenez wrote:
> > > [snip]
> > > > >> +
> > > > >> + if (aligned_count >= 4) {
> > > > > ^^
> > > > >
> > > >
On Wed, 2014-10-29 at 07:49 +1100, Timothy Arceri wrote:
> On Mon, 2014-10-27 at 20:04 +, Bruno Jimenez wrote:
> > [snip]
> > > >> +
> > > >> + if (aligned_count >= 4) {
> > > > ^^
> > > >
> > > > Hi,
> > > >
> > > > I have been thinking and I think that you can ch
On Mon, 2014-10-27 at 20:04 +, Bruno Jimenez wrote:
> [snip]
> > >> +
> > >> + if (aligned_count >= 4) {
> > > ^^
> > >
> > > Hi,
> > >
> > > I have been thinking and I think that you can change that 4 for an 8. In
> > > the case aligned_count == 4 there's no gain
[snip]
> >> +
> >> + if (aligned_count >= 4) {
> > ^^
> >
> > Hi,
> >
> > I have been thinking and I think that you can change that 4 for an 8. In
> > the case aligned_count == 4 there's no gain in using SSE, as you will
> > have to do a final reduction from 4 elements
On 10/27/2014 08:04 AM, Bruno Jimenez wrote:
> On Mon, 2014-10-27 at 08:37 +1100, Timothy Arceri wrote:
>> Makes use of SSE to speed up compute of min and max elements
>>
>> Callgrind cpu usage results from pts benchmarks:
>>
>> Openarena 0.8.8: 3.67% -> 1.03%
>> UrbanTerror: 2.36% -> 0.81%
>>
>> S
On Mon, 2014-10-27 at 08:37 +1100, Timothy Arceri wrote:
> Makes use of SSE to speed up compute of min and max elements
>
> Callgrind cpu usage results from pts benchmarks:
>
> Openarena 0.8.8: 3.67% -> 1.03%
> UrbanTerror: 2.36% -> 0.81%
>
> Signed-off-by: Timothy Arceri
> ---
> src/mesa/Andr
On 26/10/14 21:37, Timothy Arceri wrote:
> Makes use of SSE to speed up compute of min and max elements
>
> Callgrind cpu usage results from pts benchmarks:
>
> Openarena 0.8.8: 3.67% -> 1.03%
> UrbanTerror: 2.36% -> 0.81%
>
> Signed-off-by: Timothy Arceri
> ---
> src/mesa/Android.libmesa_dric
Makes use of SSE to speed up compute of min and max elements
Callgrind cpu usage results from pts benchmarks:
Openarena 0.8.8: 3.67% -> 1.03%
UrbanTerror: 2.36% -> 0.81%
Signed-off-by: Timothy Arceri
---
src/mesa/Android.libmesa_dricore.mk | 3 +-
src/mesa/Makefile.am| 3 +-
11 matches
Mail list logo