Am 31.12.2015 um 19:41 schrieb Roland Scheidegger:
> Am 31.12.2015 um 10:15 schrieb Oded Gabbay:
>> On Thu, Dec 31, 2015 at 4:13 AM, Roland Scheidegger
>> wrote:
>>>
>>> Am 30.12.2015 um 10:59 schrieb Oded Gabbay:
On Wed, Dec 30, 2015 at 1:17 AM, Roland Scheidegger
wrote:
> The id
Am 31.12.2015 um 10:15 schrieb Oded Gabbay:
> On Thu, Dec 31, 2015 at 4:13 AM, Roland Scheidegger
> wrote:
>>
>> Am 30.12.2015 um 10:59 schrieb Oded Gabbay:
>>> On Wed, Dec 30, 2015 at 1:17 AM, Roland Scheidegger
>>> wrote:
The idea looks right to me.
Though frankly I don't like our c
On Thu, Dec 31, 2015 at 4:13 AM, Roland Scheidegger wrote:
>
> Am 30.12.2015 um 10:59 schrieb Oded Gabbay:
> > On Wed, Dec 30, 2015 at 1:17 AM, Roland Scheidegger
> > wrote:
> >> The idea looks right to me.
> >> Though frankly I don't like our current setup code too much - in
> >> particular the
Am 30.12.2015 um 10:59 schrieb Oded Gabbay:
> On Wed, Dec 30, 2015 at 1:17 AM, Roland Scheidegger
> wrote:
>> The idea looks right to me.
>> Though frankly I don't like our current setup code too much - in
>> particular the mix between c, assembly, and jit code, with some
>> duplication (plus the
On Wed, Dec 30, 2015 at 1:17 AM, Roland Scheidegger wrote:
> The idea looks right to me.
> Though frankly I don't like our current setup code too much - in
> particular the mix between c, assembly, and jit code, with some
> duplication (plus the lots of transpose everywhere). There's likely
> opti
The idea looks right to me.
Though frankly I don't like our current setup code too much - in
particular the mix between c, assembly, and jit code, with some
duplication (plus the lots of transpose everywhere). There's likely
optimization potential to be found there.
Roland
Am 29.12.2015 um 17:12
This patch converts the SSE optimization done in do_triangle_ccw to
VMX/VSX.
I measured the results on POWER8 machine with 32 cores at 3.4GHz and
16GB of RAM.
FPS/Score
NameBefore AfterDelta
glmark2 (scor