Hi Tvrtko,

On 25/03/25 06:57, Tvrtko Ursulin wrote:

On 24/03/2025 23:17, Maíra Canal wrote:
Hi Tvrtko,

Thanks for this patchset! I applied this patchset to the RPi downstream
kernel 6.13.7 [1] and saw an FPS improvement of approximately 5.85%
with "vkgears -present-mailbox" on the RPi 5.

I did five 100 seconds runs on each kernel and here are my results:

### 6.13.7

|   Run    |   Min FPS   |   Max FPS   |   Avg FPS   |
|----------|-------------|-------------|-------------|
| Run #1   | 6646.52     | 6874.77     | 6739.313    |
| Run #2   | 5387.04     | 6723.274    | 6046.773    |
| Run #3   | 6230.49     | 6823.47     | 6423.923    |
| Run #4   | 5269.678    | 5870.59     | 5501.858    |
| Run #5   | 5504.54     | 6285.91     | 5859.724    |

* Overall Avg FPS: 6114.318 FPS


### 6.13.7 + DRM Syncobj optimisations

|   Run    |   Min FPS   |   Max FPS   |   Avg FPS   |
|----------|-------------|-------------|-------------|
| Run #1   | 6089.05     | 7296.27     | 6859.724    |
| Run #2   | 6022.48     | 7264        | 6818.518    |
| Run #3   | 5987.68     | 6188.77     | 6041.365    |
| Run #4   | 5699.27     | 6448.99     | 6190.374    |
| Run #5   | 6199.27     | 6791.15     | 6450.900    |

* Overall Avg FPS: 6472.176 FPS

Neat, thanks for testing! I am not surprised a slower CPU benefits more.

Btw if you have the raw data it would be nice to feed it to ministat too.

I ran again and collected the raw data. Here is the ministat:

x no-optimizations.txt
+ syncobjs-optimizations.txt
+---------------------------------------------------------------------------+
| + + | | + + ++ | | x + + ++ | | xx * ++x ++ | | * xx +*+ +*x++ ++ | | x ++x *+xxx +*+ x+*x+*x x ++ x | |x xxx ++xxxx *+xxx +*+ x***+** x + ++ ** + + x++ | |xxxxx x +***x*x*+**x xxxx* xx+** *******x* x + +++x**x+*+ + **++x xxx x| | |__________|______A_M____MA__________|___| |
+---------------------------------------------------------------------------+
    N           Min           Max        Median           Avg        Stddev
x  95      5660.033      7371.548      6413.172     6383.4326     431.10036
+  95      5914.994      7209.361      6538.192     6568.3293      345.7754
Difference at 95.0% confidence
        184.897 +/- 111.131
        2.89651% +/- 1.74093%
        (Student's t, pooled s = 390.774)

Best Regards,
- Maíra


Regards,

Tvrtko

[1] https://github.com/raspberrypi/linux/tree/rpi-6.13.y

Best Regards,
- Maíra

On 18/03/25 12:54, Tvrtko Ursulin wrote:
A small set of drm_syncobj optimisations which should make things a tiny bit
more efficient on the CPU side of things.

Improvement seems to be around 1.5%* more FPS if observed with "vkgears
-present-mailbox" on a Steam Deck Plasma desktop, but I am reluctant to make a definitive claim on the numbers since there is some run to run variance. But, as suggested by Michel Dänzer, I did do a five ~100 second runs on the each kernel
to be able to show the ministat analysis.

x before
+ after
+------------------------------------------------------------+
|                          x         +                       |
|                   x      x         +                       |
|                   x      xx      ++++                      |
|                 x x      xx x    ++++                      |
|                 x xx   x xx x+   ++++                      |
|                xxxxx   xxxxxx+   ++++ + +                  |
|                xxxxxxx xxxxxx+x  ++++ +++                  |
|              x xxxxxxxxxxx*xx+* x++++++++   ++             |
|        x x   xxxxxxxxxxxx**x*+*+*++++++++ ++++ +           |
|       xx x   xxxxxxxxxx*x****+***+**+++++ ++++++           |
|x     xxx x   xxxxx*x****x***********+*++**+++++++   +  +  +|
|               |_______A______|                             |
|                             |______A_______|               |
+------------------------------------------------------------+
     N           Min           Max        Median           Avg Stddev
x 135      21697.58     22809.467     22321.396     22307.707 198.75011
+ 118     22200.746      23277.09       22661.4     22671.442 192.10609
Difference at 95.0% confidence
     363.735 +/- 48.3345
     1.63054% +/- 0.216672%
     (Student's t, pooled s = 195.681)

Tvrtko Ursulin (7):
   drm/syncobj: Remove unhelpful helper
   drm/syncobj: Do not allocate an array to store zeros when waiting
   drm/syncobj: Avoid one temporary allocation in drm_syncobj_array_find
   drm/syncobj: Use put_user in drm_syncobj_query_ioctl
   drm/syncobj: Avoid temporary allocation in
     drm_syncobj_timeline_signal_ioctl
   drm/syncobj: Add a fast path to drm_syncobj_array_wait_timeout
   drm/syncobj: Add a fast path to drm_syncobj_array_find

  drivers/gpu/drm/drm_syncobj.c | 281 ++++++++++++++++++----------------
  1 file changed, 147 insertions(+), 134 deletions(-)




Reply via email to