Re: glxsync - explicit frame synchronization sample implementation

Michael Clark Thu, 30 Dec 2021 16:31:30 -0800

On 30/12/21 6:20 pm, Michael Clark wrote:

Dear Mesa Developers,
I have been using GLFW for tiny cross-platform OpenGL demos for sometime but something that has really been bothering me are the visualartifacts when resizing windows. Over the last year or so I have mademultiple attempts at solving this issue, digging progressively deepereach time, until spending the last month researching compositorsynchronization protocols, reading compositor code, and writing thisdemo as a prelude to figuring out how one might fix this issue in GLFWor even Chrome.
I decided that first it might be a good idea to come up with thesimplest possible isolated example comprising of a near completesolution without the unnecessary complexity of layering for all of thecross-platform abstractions. It seems to me despite the ease this can besolved with Wayland EGL, it is still useful, primarily for widercompatibility, to be able to package X11 GLX applications, which is thewindow system that I typically use when targeting Linux with GLFW.
That brings me to _glxsync_ which is an attempt at creating a minimallycorrect implementation of explicit frame synchronization using X11, GLX,XSync and the latest compositor synchronization protocols [1,2], testedto work with mutter and GNOME on Xorg or Xwayland.
- https://github.com/michaeljclark/glxsync/
_glxsync_ is an X Windows OpenGL demo app using GLX and XSync extendedframe synchronization responding to synchronization requests from thecompositor in response to configuration changes for window resizes. Thedemo updates extended synchronization counters before and after framesto signal to the compositor that rendering is in progress so thatbuffers read by the compositor are complete and matches the size inconfiguration change events. It also has rudimentary congestion control.
_glxsync_ depends on the following X11 window system atoms:

- _NET_WM_SYNC_REQUEST
- _NET_WM_SYNC_REQUEST_COUNTER
- _NET_WM_FRAME_DRAWN
- _NET_WM_FRAME_TIMINGS
- _NET_WM_PING

_glxsync_ *does not* yet implement the following extensions:

- _NET_WM_SYNC_FENCES
- _NET_WM_MOVERESIZE

_glxsync_ depends on the following libraries: _X11, Xext, GLX, GL_.
I have to say there were numerous subtle issues that I found whiletesting this code on Ubuntu 21.10 XWayland with an Intel Mesa graphicsstack and Ubuntu 20.04 LTS Xorg with the NVIDIA proprietary graphicsstack, so I have no idea how it will fly with other drivers and am veryinterested in feedback. There really is not much sample code that Icould find that addresses this issue.
I found the Intel driver particularly finicky and there are some verycarefully placed XFlush calls *before* frame renders, and XSync callsduring congestion. There are also the beginnings of adaptive frame rateusing frame times and render timings stored in a circular buffer. Thatsaid, there is no advanced adaptive frame rate logic beyond detectingcircumstances that can lead to tears with a back-off to the measuredshort term average frame rate from statistics, and some logic to delayframes when there are collisions with Expose events.

I would like to add these implementation notes to the README becausethis is information one cannot easily find. It occurs to me that XFlushbefore frames makes a lot more sense than after frames if one thinksabout Nagle and flow control combined with frame pacing. If we havecapacity to render at a constant frame rate with accurate scheduling forthe start of frames, then an XFlush(dpy) marker placed at the start ofthe frame will occur at a constant rate, subject to variable rendertimes, whereas an XFlush(dpy) marker placed at the end of the framewould have irregular timings needing stats for recovery. I am guessingthese are conversations that folks have already had because it seems towork on my machine. An XSync(dpy, False) marker for congestion controlalso seems to make sense to me because if we get frame drops we want toresynchronize input and output. I am not sure under which conditions onemay wish to do XSync(dpy, True). Possibly some sort of watchdog or hangcheck for IO when recovering from flooding.

Anyway I don't know where to go for this information so I am verbalizingit to see if anyone can acknowledge it as being reasonable protocol.

There is also some rudimentary tracing infrastructure and some carefullyplaced calls to poll, XEventsQueued(d, QueuedAlready), XEventsQueued(d,QueuedAfterReading) to avoid blocking in XNextEvent at all costs. Ifound it necessary to add a heuristic to avoid frame submission untilreceiving frame timings from the compositor. Intuitively one might thinkthis makes the loop synchronous, but with the NVIDIA driver, it appearsthe heuristic still allows multiple frames to be submitted in advance.It is certainly finicky to debug. There is a --no-sync option tosimulate the absence of compositor synchronization as a testing aid.
There is very little back-pressure signaling to the client beyond theability to observe timings and serial numbers in frame drawn and frametiming messages. It worries me that I need very careful placement ofXFlush and XSync to make the demo work so I would really appreciatefeedback if I am doing it wrong. There is some interesting potential forcontrol loops when using stats for adaptive frame rate, so I have notyet attempted any sophisticated congestion control algorithm.

I have a feeling the delays I am introducing after collision alter theframe time offset and this is not something I have added to that sampleto recover from after a flood of Expose events. Does one stutter or doesone warp time over some period to resynchronize back to the verticalblank time offset. I implemented frame pacing but that sample does notconsider the vertical blank offset yet. Interesting problem.

It occurs that mixing implicit and explicit frame synchronization wouldbe a nightmare to debug. I am wondering if the use of XFlush (and maybeXSync) markers as part of the frame sync protocol for OpenGL over theGLX encapsulation is a good idea. The XFlush before each frame seemednecessary in my testing, at least for interoperability between the Mesastack and the NVIDIA stack. nouveau and amdgpu are still unknowns.

In any case I am sharing this code with the hopes that folk can helpwith testing. I was thinking to make a patch for GLFW but this was afirst step. I would really appreciate if folks could help test ondifferent drivers such as nouveau and amdgpu as I don't have access tothem. The code is currently released under the PLEASE LICENSE which ispractically public domain with one exception, but I am not disinclinedtowards releasing it under an MIT license if it were found to be auseful sample to add to the mesa demos.
Is there a place in mesa-demos for a frame synchronization demo? I seeglsync. Is there a compositor sync example that I may have missed? I canimagine with the addition of WM_MOVERESIZE it could be used for tests.This is pretty much version 0.0.1. i.e. is clean enough to release.
Regards,
Michael Clark

[1] https://fishsoup.net/misc/wm-spec-synchronization.html
[2] https://lwn.net/Articles/814587/

Re: glxsync - explicit frame synchronization sample implementation

Reply via email to