[Mesa-dev] [PATCH] [V2] i965: Context aware user space EU control through application

2018-08-13 Thread aravindan . muthukumar
rs can read meaning of high/medium/low for their platform & then program contexts accordingly. Here gpu_load_type directly translates to number of shader cores/EUs a particular GPU has. V2 : changes for setting the loadtype through setparam. Signed-off-by: Aravindan Muthukumar Signed-

[Mesa-dev] [PATCH 2/2] i965: Context aware user space EU control through application

2018-07-20 Thread aravindan . muthukumar
rs can read meaning of high/medium/low for their platform & then program contexts accordingly. Here gpu_load_type directly translates to number of shader cores/EUs a particular GPU has. Signed-off-by: Aravindan Muthukumar Signed-off-by: Kedar J Karanje Signed-off-by: Praveen

[Mesa-dev] [PATCH 1/2] RFC : Context aware user space Resource control

2018-07-20 Thread aravindan . muthukumar
ntexts accordingly. Here, gpu_load_type directly translates to number of shader cores/EUs a particular GPU has. Signed-off-by: Aravindan Muthukumar Signed-off-by: Kedar J Karanje Signed-off-by: Praveen Diwakar Signed-off-by: Yogesh Marathe --- include/EGL/eglext.h

[Mesa-dev] [PATCH v4] i965 : optimized bucket index calculation

2017-11-08 Thread aravindan . muthukumar
From: Aravindan Muthukumar Reducing Bucket index calculation to O(1). This algorithm calculates the index using matrix method. Matrix arrangement is as below: Assuming PAGE_SIZE is 4096. 1*4096 2*40963*40964*4096 5*4096 6*40967*40968*4096 10

[Mesa-dev] [PATCH v3] i965 : optimized bucket index calculation

2017-11-06 Thread aravindan . muthukumar
From: Aravindan Muthukumar Now the complexity has been reduced to O(1) Algorithm calculates the index using matrix method. Matrix arrangement is as below: Assuming PAGE_SIZE is 4096. 1*4096 2*40963*40964*4096 5*4096 6*40967*40968*4096 10*4096

[Mesa-dev] [PATCH v3] i965 : optimized bucket index calculation.

2017-10-26 Thread aravindan . muthukumar
From: Aravindan Muthukumar Avoiding the loop which was running with O(n) complexity. Now the complexity has been reduced to O(1) Algorithm calculates the index using matrix method. Matrix arrangement is as below: Assuming PAGE_SIZE is 4096. 1*4096 2*40963*40964*4096

[Mesa-dev] [PATCH v2] i965 : optimized bucket index calculation

2017-09-13 Thread aravindan . muthukumar
From: Aravindan Muthukumar Avoiding the loop which was running with O(n) complexity. Now the complexity has been reduced to O(1) Algorithm calculates the index using matrix method. Matrix arrangement is as below: Assuming PAGE_SIZE is 4096. 1*4096 2*40963*40964*4096

[Mesa-dev] [PATCH] i965 : optimized bucket index calculation

2017-09-08 Thread aravindan . muthukumar
From: Aravindan Muthukumar Avoiding the loop which was running with O(n) complexity. Now the complexity has been reduced to O(1) Tested with piglit. Slight performance improvement (~1%) in 3d mark. Change-Id: Id099f1cd24ad5b691a69070eda79b8f4e9be39a6 Signed-off-by: Aravindan Muthukumar Signed

[Mesa-dev] [PATCH] i965: Avoids loop for buffer object availability in add_exec_bo

2017-07-28 Thread aravindan . muthukumar
From: Aravindan Muthukumar Original logic loops over the list for every buffer object. Maintained a flag to identify whether bo is already there in list. Improves performance - 3DMark by 2% Tested with piglit Signed-off-by: Aravindan Muthukumar Signed-off-by: Yogesh Marathe --- src/mesa

[Mesa-dev] [PATCH V3] i965 : Optimize atom state flag checks

2017-07-20 Thread aravindan . muthukumar
From: Aravindan Muthukumar This patch improves CPI Rate(Cycles per Instruction) and branch miss predict for i965. The function check_state() was showing CPI retired rate. Performance stats with android: - CPI retired lowered by 28% (lower is better) - Branch missprediction lowered by 13% (lower

[Mesa-dev] [PATCH V2] i965 : Optimize atom state flag checks

2017-07-20 Thread aravindan . muthukumar
From: Aravindan Muthukumar This patch improves CPI Rate(Cycles per Instruction) and branch mispredict for i965. The function check_state() was showing CPI retired rate. Performance stats with android: CPI retired lowered by 28% (lower is better) Branch missprediction lowered by 13% (lower is

[Mesa-dev] [PATCH] i965 : Performance Improvement

2017-07-13 Thread aravindan . muthukumar
From: Aravindan M This patch improves CPI Rate(Cycles per Instruction) and CPU time utilization for i965. The functions check_state and brw_pipeline_state_finished was found poor CPU utilization from performance analysis. Change-Id: I17c7e719a16e222764217a0e67b4482748537b67 Signed-off-by: Aravin

[Mesa-dev] [PATCH] i965 : Performance Improvement

2017-07-13 Thread aravindan . muthukumar
From: Aravindan M This patch improves CPI Rate(Cycles per Instruction) and CPU time utilization for i965. The functions check_state and brw_pipeline_state_finished was found poor CPU utilization from performance analysis. Change-Id: I17c7e719a16e222764217a0e67b4482748537b67 Signed-off-by: Aravin