030: Unthrottle parallel jobs in reverse

Hanna Reitz Mon, 15 Nov 2021 05:56:51 -0800

On 12.11.21 17:25, Vladimir Sementsov-Ogievskiy wrote:

11.11.2021 15:08, Hanna Reitz wrote:
See the comment for why this is necessary.
Signed-off-by: Hanna Reitz <hre...@redhat.com>
---
  tests/qemu-iotests/030 | 11 ++++++++++-
  1 file changed, 10 insertions(+), 1 deletion(-)

diff --git a/tests/qemu-iotests/030 b/tests/qemu-iotests/030
index 5fb65b4bef..567bf1da67 100755
--- a/tests/qemu-iotests/030
+++ b/tests/qemu-iotests/030
@@ -251,7 +251,16 @@ class TestParallelOps(iotests.QMPTestCase):
                                   speed=1024)
              self.assert_qmp(result, 'return', {})
  -        for job in pending_jobs:
+ # Do this in reverse: After unthrottling them, some jobs mayfinish+ # before we have unthrottled all of them. This will draintheir+ # subgraph, and this will make jobs above them advance(despite those+ # jobs on top being throttled). In the worst case, all jobsbelow the+ # top one are finished before we can unthrottle it, and thismakes it+ # advance so far that it completes before we can unthrottleit - which
+        # results in an error.
+ # Starting from the top (i.e. in reverse) does not have thisproblem:
+        # When a job finishes, the ones below it are not advanced.
Hmm, interesting why only jobs above the finished job may advance inthe situation..
Looks like something may change and this workaround will stop working.
Isn't it better just handle the error, and don't care if job was justfinished?
Something like

if result['return'] != {}:
# Job was finished during drain caused by finish of alreadyunthrottled job
   self.assert_qmp(result, 'error/class', 'DeviceNotActive')

Well. My explanation (excuse) is that I felt like this was the hack-ishsolution that I could have gone for from the start without understandingwhat the issue is (and in fact it was the solution I used whiledebugging the other problems). I went with `reversed()`, because thatreally addresses the problem.

You’re right in that it only addresses the problem for now and there’s achance it might reappear. If we want to go with ignoringDeviceNotActive errors, then I think we should at least query all blockjobs before the unthrottle loop and see that at least at one point theywere all running simultaneously.

I don’t really have a strong opinion. We can exchange this patch now(though I’d rather not hold up the rest of the series for it), or have apatch on top later, or, well, just keep it for now. I think the leaststressful option would be to just fix it up later.


Hanna

Re: [PATCH v2 10/10] iotests/030: Unthrottle parallel jobs in reverse

Reply via email to