Hi Michael,
Thanks for the input, I had forgotten to disable/undef DEBUG_PERF, which I
was using to get a rough idea of the time taken by the logic using the
processor performance counter on x86. The logic doesnt use x86intrinsic for
any other purpose, so disabling DEBUG_PERF will fix it.
I can r
On Wed, Jul 01, 2020 at 10:13:48PM +0530, hanishkvc wrote:
> v04-20200701IST2132, fbdetile
>
> Optimised Generic Detile logic to detile multiple tiles in parallel,
> logically speaking. This reduces cross check overheads to some extent
> and thus improves speed a bit. While in a hardware or multic