Should we wait for e.g. five clean CI runs in a row? Historically flaky > tests have been a real issue for the project, and CI success probably > shouldn't be taken instantaneously for releases.
There are tickets for flakey tests that have been pushed to fixVersion 4.0-rc intentionally, making this difficult to achieve. A green run will be a huge achievement for the project, something we haven't seen in a very long time. My understanding of the "we see one clean CI run" position was taking it as a stake-in-the-ground, knowing (and expecting) that the situation improves (with the ongoing work by many) towards GA. Might we instead apply the criteria on one of the rc releases before GA ?