[
https://issues.apache.org/jira/browse/SOLR-7736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421394#comment-16421394
]
Mikhail Khludnev commented on SOLR-7736:
----------------------------------------
the problem looks as follows: we have OverseerAutoScalingTriggerThread
[spining|https://github.com/apache/lucene-solr/blob/master/solr/core/src/java/org/apache/solr/cloud/autoscaling/OverseerTriggerThread.java#L142]
in the same error
{code}
[junit4] 2> 2650914 ERROR
(OverseerAutoScalingTriggerThread-72373776421355522-127.0.0.1:8983_solr-n_0000000001)
[ ] o.a.s.c.a.OverseerTriggerThread A ZK error has occurred
[junit4] 2> java.io.IOException:
org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode =
Session expired for /autoscaling.json
[junit4] 2> at
org.apache.solr.client.solrj.impl.ZkDistribStateManager.getAutoScalingConfig(ZkDistribStateManager.java:183)
~[java/:?]
[junit4] 2> at
org.apache.solr.client.solrj.cloud.DistribStateManager.getAutoScalingConfig(DistribStateManager.java:83)
~[java/:?]
[junit4] 2> at
org.apache.solr.cloud.autoscaling.OverseerTriggerThread.run(OverseerTriggerThread.java:127)
[java/:?]
[junit4] 2> at java.lang.Thread.run(Thread.java:748) [?:1.8.0_152]
[junit4] 2> Caused by:
org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode =
Session expired for /autoscaling.json
[junit4] 2> at
org.apache.zookeeper.KeeperException.create(KeeperException.java:130)
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]
[junit4] 2> at
org.apache.zookeeper.KeeperException.create(KeeperException.java:54)
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]
[junit4] 2> at
org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:1215)
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]
[junit4] 2> at
org.apache.solr.common.cloud.SolrZkClient.lambda$getData$5(SolrZkClient.java:340)
~[java/:?]
[junit4] 2> at
org.apache.solr.common.cloud.ZkCmdExecutor.retryOperation(ZkCmdExecutor.java:60)
~[java/:?]
[junit4] 2> at
org.apache.solr.common.cloud.SolrZkClient.getData(SolrZkClient.java:340)
~[java/:?]
[junit4] 2> at
org.apache.solr.client.solrj.impl.ZkDistribStateManager.getAutoScalingConfig(ZkDistribStateManager.java:176)
~[java/:?]
[junit4] 2> ... 3 more
{code}
That Zk session expiration is a separate question. As I remember in some places
a session is restored, there are a lot of methods where SessionExpiredException
fallen through, and I don't fully understand why. There a other questions about
root case. I want to tackle just interruption problem so far. It seems like
this spin can not be interrupted by the thread lingering.
> Add a test for ZkController.publishAndWaitForDownStates
> -------------------------------------------------------
>
> Key: SOLR-7736
> URL: https://issues.apache.org/jira/browse/SOLR-7736
> Project: Solr
> Issue Type: Test
> Components: SolrCloud, Tests
> Reporter: Shalin Shekhar Mangar
> Assignee: Shalin Shekhar Mangar
> Priority: Minor
> Fix For: 7.4, master (8.0)
>
> Attachments: SOLR-7736.patch, consoleFull-2462-ZkControllerTest.txt.gz
>
>
> Add a test for ZkController.publishAndWaitForDownStates so that bugs like
> SOLR-6665 do not occur again. A test exists but it is not correct and
> currently disabled via AwaitsFix.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]