gharris1727 opened a new pull request #9040:
URL: https://github.com/apache/kafka/pull/9040


   Signed-off-by: Greg Harris <gr...@confluent.io>
   
   Currently, the system tests `connect_distributed_test` and 
`connect_rest_test` only wait for the REST api to come up.
   The startup of the worker includes an asynchronous process for joining the 
worker group and syncing with other workers.
   There are some situations in which this sync takes an unusually long time, 
and the test continues without all workers up.
   This leads to flakey test failures, as worker joins are not given sufficient 
time to timeout and retry without waiting explicitly.
   
   This changes the ConnectDistributedTest to wait for the `Joined group` 
message to be printed to the logs before continuing with tests. I've activated 
this behavior by default, as it's a superset of the checks that were performed 
by default before.
   
   This log message is present in every version of DistributedHerder that I 
could find, in slightly different forms, but always with `Joined group` at the 
beginning of the log message. This change should be safe to backport to any 
branch.
   
   ### Committer Checklist (excluded from commit message)
   - [ ] Verify design and implementation 
   - [ ] Verify test coverage and CI build status
   - [ ] Verify documentation (including upgrade notes)
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to