Dear Pulsar community members,

In order to improve our CI, we will have to fix the flaky tests. In some
cases it might be necessary to replace an existing test with a redesigned
test.

The draft PIP "Changes to flaky test handling" document
<https://docs.google.com/document/d/10lmn4pW1IsT_8D1ZE0vMjASX0HhjdGdjB794iyScwns/edit?usp=sharing>
lists
the top 10 flaky tests. A lot of them have already been address by pull
requests in the past week or so.

This is the list of recent PRs that fix flaky tests from the top 10 flaky
tests list:
https://github.com/apache/pulsar/pull/9286
https://github.com/apache/pulsar/pull/9243
https://github.com/apache/pulsar/pull/9258
https://github.com/apache/pulsar/pull/9356

These are the GH issues for the remaining ones in the top 10 flaky tests
list:
https://github.com/apache/pulsar/issues/6368
https://github.com/apache/pulsar/issues/9369
https://github.com/apache/pulsar/issues/9368

If you would like to help to fix flaky tests you can pick one of the open
issues above. Just add a comment on the issue when you start working on it
so that we can coordinate activities.

It is also helpful to report a flaky test when you encounter one. I've been
using this type of template for reporting a flaky test:
https://gist.github.com/lhotari/a5c67359b362b4f3d8729330d65a2298 . The
issues #9368 and #9369 have been reported using this template.
Search for the test name before reporting so that we don't end up with
duplicates.

The issues #6368, #9369 and #9368 are the 3 next important issues to fix.
I'm planning to create a more extensive list of the flaky failures so that
we can target the most flaky ones when we continue fixing the flaky tests.
I have some scripts in development to assist in mining the Pulsar Github
Action workflow run logs.

This is a search to find flaky issues in Pulsar GH issues:
https://github.com/apache/pulsar/issues?q=flaky+sort%3Aupdated-desc+is%3Aopen

Looking forward to the contributions for fixing flaky tests,

BR,

Lari

Reply via email to