On Fri, 3 Mar 2023 at 11:29, Thomas Huth <th...@redhat.com> wrote: > > On 03/03/2023 12.18, Peter Maydell wrote: > > On Fri, 3 Mar 2023 at 09:10, Juan Quintela <quint...@redhat.com> wrote: > >> What is really weird with this failure is that: > >> - it only happens on non-x86 > > > > No, I have seen it on x86 macos, and x86 OpenBSD > > > >> - on code that is not arch dependent > >> - on cancel, what we really do there is close fd's for the multifd > >> channel threads to get out of the recv, i.e. again, nothing that > >> should be arch dependent. > > > > I'm pretty sure that it tends to happen when the machine that's > > running the test is heavily loaded. You probably have a race condition. > > I think I can second that. IIRC I've seen it a couple of times on my x86 > laptop when running "make check -j$(nproc) SPEED=slow" here.
Also, to repeat an offer I've made before, on my x86 macos laptop 'make check -j4' fails on this test about one time in two. If you write a patch to add whatever logging information you would find useful for debugging, I'm happy to do test runs with that patch and send you the logs. -- PMM