This approach is generally right but implemented in the wrong place.
This "lowerPOW" happens pre-ssa. What actually needs to happen is that
this type of optimization is done at SSA time as part of
ConstantFolding. And the fallback for POW should be implemented as
part of the "legalize" step. That w
On Thu, Jun 22, 2017 at 9:44 PM, Roland Scheidegger wrote:
> Am 22.06.2017 um 21:28 schrieb Karol Herbst:
>> If the exponent is a small integer immediate value, we can lower POW to
>> MULs instead to save a few instructions. Also MUL instructions execute
>> faster than what we lower POW in the def
Am 22.06.2017 um 21:28 schrieb Karol Herbst:
> If the exponent is a small integer immediate value, we can lower POW to
> MULs instead to save a few instructions. Also MUL instructions execute
> faster than what we lower POW in the default case to.
>
> score change for GpuTest /test=pixmark_piano /
If the exponent is a small integer immediate value, we can lower POW to
MULs instead to save a few instructions. Also MUL instructions execute
faster than what we lower POW in the default case to.
score change for GpuTest /test=pixmark_piano /benchmark /no_scorebox
/msaa=0 /benchmark_duration_ms=6