On 12/31/2012 05:34 AM, Vadim Girlin wrote:
Optimize it for better performance.

Signed-off-by: Vadim Girlin <vadimgir...@gmail.com>
---

I did some benchmarking for this patch together with the patch that introduces
ISA tables, using the texCombine test (IIRC it builds hundreds of shaders) and
making the driver build each shader 5000 times, total time of the test was
reduced from 45 to 25 seconds, the time spent in check_and_set_bank_swizzle
is about 8 times less according to the oprofile, the time of
r600_shader_from_tgsi is reduced by half.

You can also find the patch in this branch:

http://cgit.freedesktop.org/~vadimg/mesa/log/?h=r600-disasm

After some other benchmarks I think that texCombine benchmark that I used is close to the worst possible case (when most of the inst groups do not use default swizzles, so we have to perform a lot of iterations to find the correct swizzles). Now with other tests I see that in some cases (where most of the alu groups just use default swizzles) this patch might even have slightly worse performance due to additional overhead. I'm going to improve the handling of the default case and gather some stats over the popular apps, and then update the patch to make it more efficient in every case.

Vadim

_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev

Reply via email to