[ https://issues.apache.org/jira/browse/SOLR-16416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17608380#comment-17608380 ]
ASF subversion and git services commented on SOLR-16416: -------------------------------------------------------- Commit f529154f74078e9012be486a684bdd021072df72 in solr's branch refs/heads/branch_9x from Houston Putman [ https://gitbox.apache.org/repos/asf?p=solr.git;h=f529154f740 ] SOLR-16416: Add error logging to allow debugging of OverseerNodePrioritizer (cherry picked from commit 0aa9b333d94f7573f83a414586f0ae34fe1c13a5) > Fix silently failing Overseer Election joinAtHead during > testDesignatedOverseerRestarts > --------------------------------------------------------------------------------------- > > Key: SOLR-16416 > URL: https://issues.apache.org/jira/browse/SOLR-16416 > Project: Solr > Issue Type: Bug > Security Level: Public(Default Security Level. Issues are Public) > Reporter: Houston Putman > Priority: Major > > OverseerRolesTest.testDesignatedOverseerRestarts has been failing > consistently (around 2.5% of the time). I think this is because > LeaderElection.joinElection does not respect the joinAtHead flag, if > connectionIssues happen while setting the leader election nodes. > LeaderElection does not use the automatic retryOnConnLoss flags when doing zk > operations. Instead, it waits for an error to come back, and it handles the > retry itself. This is fine for the normal case, because it checks if node is > represented in the leaderElection child nodes, and if so it ignores the > connection loss. However when doing joinAtHead, if the childNode exists, but > isn't at the place it should be, then the manual retry should be exercised. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org For additional commands, e-mail: issues-h...@solr.apache.org