[ https://issues.apache.org/jira/browse/KUDU-3266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17307221#comment-17307221 ]
ASF subversion and git services commented on KUDU-3266: ------------------------------------------------------- Commit ac09d6205486908df05af4d64dc7618aae7c37bf in kudu's branch refs/heads/master from Bankim Bhavsar [ https://gitbox.apache.org/repos/asf?p=kudu.git;h=ac09d62 ] [test] KUDU-3266 Fix flakiness in dynamic_multi_master test Flakiness was reported in dynamic_multi_master test after the introduction of test for recovering dead master, commit 4b4a8c0f2f. See KUDU-3266 for the analysis. This change wraps the check for row count under ASSERT_EVENTUALLY to ensure the resumed master and the remaining master are given a chance to communicate Raft messages and become up to date. Tests: - Reproduced the issue with ASAN build with dist-test. - Verified no failures over 100 iterations with the fix on ASAN build. Change-Id: Ifac1d95707064b6ac2624d3f52336d6c39afd3c8 Reviewed-on: http://gerrit.cloudera.org:8080/17211 Tested-by: Bankim Bhavsar <ban...@cloudera.com> Reviewed-by: Andrew Wong <aw...@cloudera.com> Reviewed-by: Mahesh Reddy <mre...@cloudera.com> Reviewed-by: Alexey Serbin <aser...@cloudera.com> > Flakiness in dynamic_multi_master_test in VerifyClusterAfterMasterAddition() > function > ------------------------------------------------------------------------------------- > > Key: KUDU-3266 > URL: https://issues.apache.org/jira/browse/KUDU-3266 > Project: Kudu > Issue Type: Test > Components: master, test > Affects Versions: 1.15.0 > Reporter: Bankim Bhavsar > Assignee: Bankim Bhavsar > Priority: Major > > {noformat} > ParameterizedRecoverMasterTest.TestRecoverDeadMasterSysCatalogCopy/1: > /data0/somelongdirectorytoavoidrpathissues/src/kudu/src/kudu/integration-tests/cluster_verifier.cc:119: > Failure > Failed > Bad status: Not found: Unable to open table: the table does not exist: > table_name: "table-1" > /data0/somelongdirectorytoavoidrpathissues/src/kudu/src/kudu/master/dynamic_multi_master-test.cc:603: > Failure > Expected: cv.CheckRowCount(table_name, ClusterVerifier::EXACTLY, 0) doesn't > generate new fatal failures in the current thread. > Actual: it does. > 2021-03-17T17:04:19Z chronyd exiting > /data0/somelongdirectorytoavoidrpathissues/src/kudu/src/kudu/master/dynamic_multi_master-test.cc:1099: > Failure > Expected: VerifyClusterAfterMasterAddition(master_hps, orig_num_masters_) > doesn't generate new fatal failures in the current thread. > Actual: it does. > {noformat} > Although the same verification function is used by other tests for add > master, this flakiness started showing up after introduction of the > RecoverDeadMaster test. > https://github.com/apache/kudu/commit/4b4a8c0f2fdfd15524510821b27fc9c3b5d26b6b -- This message was sent by Atlassian Jira (v8.3.4#803005)