[ 
https://issues.apache.org/jira/browse/KUDU-3266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17307221#comment-17307221
 ] 

ASF subversion and git services commented on KUDU-3266:
-------------------------------------------------------

Commit ac09d6205486908df05af4d64dc7618aae7c37bf in kudu's branch 
refs/heads/master from Bankim Bhavsar
[ https://gitbox.apache.org/repos/asf?p=kudu.git;h=ac09d62 ]

[test] KUDU-3266 Fix flakiness in dynamic_multi_master test

Flakiness was reported in dynamic_multi_master test after
the introduction of test for recovering dead master,
commit 4b4a8c0f2f.

See KUDU-3266 for the analysis.

This change wraps the check for row count under ASSERT_EVENTUALLY
to ensure the resumed master and the remaining master are given
a chance to communicate Raft messages and become up to date.

Tests:
- Reproduced the issue with ASAN build with dist-test.
- Verified no failures over 100 iterations with the fix
on ASAN build.

Change-Id: Ifac1d95707064b6ac2624d3f52336d6c39afd3c8
Reviewed-on: http://gerrit.cloudera.org:8080/17211
Tested-by: Bankim Bhavsar <ban...@cloudera.com>
Reviewed-by: Andrew Wong <aw...@cloudera.com>
Reviewed-by: Mahesh Reddy <mre...@cloudera.com>
Reviewed-by: Alexey Serbin <aser...@cloudera.com>


> Flakiness in dynamic_multi_master_test in VerifyClusterAfterMasterAddition() 
> function
> -------------------------------------------------------------------------------------
>
>                 Key: KUDU-3266
>                 URL: https://issues.apache.org/jira/browse/KUDU-3266
>             Project: Kudu
>          Issue Type: Test
>          Components: master, test
>    Affects Versions: 1.15.0
>            Reporter: Bankim Bhavsar
>            Assignee: Bankim Bhavsar
>            Priority: Major
>
> {noformat}
> ParameterizedRecoverMasterTest.TestRecoverDeadMasterSysCatalogCopy/1: 
> /data0/somelongdirectorytoavoidrpathissues/src/kudu/src/kudu/integration-tests/cluster_verifier.cc:119:
>  Failure
> Failed
> Bad status: Not found: Unable to open table: the table does not exist: 
> table_name: "table-1"
> /data0/somelongdirectorytoavoidrpathissues/src/kudu/src/kudu/master/dynamic_multi_master-test.cc:603:
>  Failure
> Expected: cv.CheckRowCount(table_name, ClusterVerifier::EXACTLY, 0) doesn't 
> generate new fatal failures in the current thread.
>   Actual: it does.
> 2021-03-17T17:04:19Z chronyd exiting
> /data0/somelongdirectorytoavoidrpathissues/src/kudu/src/kudu/master/dynamic_multi_master-test.cc:1099:
>  Failure
> Expected: VerifyClusterAfterMasterAddition(master_hps, orig_num_masters_) 
> doesn't generate new fatal failures in the current thread.
>   Actual: it does.
> {noformat}
> Although the same verification function is used by other tests for add 
> master, this flakiness started showing up after introduction of the 
> RecoverDeadMaster test.
> https://github.com/apache/kudu/commit/4b4a8c0f2fdfd15524510821b27fc9c3b5d26b6b



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to