[ 
https://issues.apache.org/jira/browse/SOLR-12412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16545876#comment-16545876
 ] 

Varun Thacker commented on SOLR-12412:
--------------------------------------

With regards to the actual failure , I think we are shutting down the wrong 
Jetty?

 

>From the seed we have numReplicas=2.  Which means we want to shutdown the 
>non-leader shard but from the logs it's shutting down the leader jetty? 

And then when we go to corrupt the leader jetty , it's actually closed ?
{code:java}
[junit4] 2> 13526 INFO 
(TEST-LeaderTragicEventTest.testOtherReplicasAreNotActive-seed#[7146D51E1F1D9F1A])
 [ ] o.a.s.c.ZkController Remove node as live in 
ZooKeeper:/live_nodes/127.0.0.1:35477_solr
[junit4] 2> 13526 INFO 
(TEST-LeaderTragicEventTest.testOtherReplicasAreNotActive-seed#[7146D51E1F1D9F1A])
 [ ] o.a.s.m.SolrMetricManager Closing metric reporters for 
registry=solr.cluster, tag=null
[junit4] 2> 13526 INFO (zkCallback-17-thread-1) [ ] o.a.s.c.c.ZkStateReader 
Updated live nodes from ZooKeeper... (2) -> (1)
....

[junit4] 2> 13543 INFO (coreCloseExecutor-33-thread-1) [n:127.0.0.1:35477_solr 
c:collection1 s:shard1 r:core_node3 x:collection1_shard1_replica_n1] 
o.a.s.m.SolrMetricManager Closing metric reporters for 
registry=solr.collection.collection1.shard1.leader, tag=f37433
...
[junit4] 2> 13554 INFO 
(OverseerStateUpdate-72132540686336006-127.0.0.1:35477_solr-n_0000000000) [ ] 
o.a.s.c.Overseer Overseer Loop exiting : 127.0.0.1:35477_solr
[junit4] 2> 13554 WARN 
(OverseerAutoScalingTriggerThread-72132540686336006-127.0.0.1:35477_solr-n_0000000000)
 [ ] o.a.s.c.a.OverseerTriggerThread OverseerTriggerThread woken up but we are 
closed, exiting.
[junit4] 2> 13562 INFO (zkCallback-17-thread-1) [ ] 
o.a.s.c.OverseerElectionContext I am going to be the leader 127.0.0.1:36827_solr
[junit4] 2> 13562 INFO (zkCallback-17-thread-1) [ ] o.a.s.c.Overseer Overseer 
(id=72132540686336005-127.0.0.1:36827_solr-n_0000000001) starting
...
[junit4] 2> 13575 INFO 
(TEST-LeaderTragicEventTest.testOtherReplicasAreNotActive-seed#[7146D51E1F1D9F1A])
 [ ] o.a.s.SolrTestCaseJ4 ###Ending testOtherReplicasAreNotActive
[junit4] 2> NOTE: reproduce with: ant test -Dtestcase=LeaderTragicEventTest 
-Dtests.method=testOtherReplicasAreNotActive -Dtests.seed=7146D51E1F1D9F1A 
-Dtests.multiplier=3 -Dtests.slow=true -Dtests.badapples=true 
-Dtests.locale=es-CL -Dtests.timezone=Pacific/Niue -Dtests.asserts=true 
-Dtests.file.encoding=ISO-8859-1
[junit4] ERROR 5.96s J2 | LeaderTragicEventTest.testOtherReplicasAreNotActive 
<<<
[junit4] > Throwable #1: java.lang.IllegalStateException: Jetty Connector is 
not open: -2
[junit4] >    at 
__randomizedtesting.SeedInfo.seed([7146D51E1F1D9F1A:F4F2F96923E22682]:0)
[junit4] >    at 
org.apache.solr.client.solrj.embedded.JettySolrRunner.getBaseUrl(JettySolrRunner.java:499)
[junit4] >    at 
org.apache.solr.cloud.MiniSolrCloudCluster.getReplicaJetty(MiniSolrCloudCluster.java:539)
[junit4] >    at 
org.apache.solr.cloud.LeaderTragicEventTest.corruptLeader(LeaderTragicEventTest.java:100)
[junit4] >    at 
org.apache.solr.cloud.LeaderTragicEventTest.testOtherReplicasAreNotActive(LeaderTragicEventTest.java:150)
[junit4] >    at java.lang.Thread.run(Thread.java:748){code}
 

> Leader should give up leadership when IndexWriter.tragedy occur
> ---------------------------------------------------------------
>
>                 Key: SOLR-12412
>                 URL: https://issues.apache.org/jira/browse/SOLR-12412
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Cao Manh Dat
>            Assignee: Cao Manh Dat
>            Priority: Major
>         Attachments: SOLR-12412.patch, SOLR-12412.patch, 
> jenkins-failure-2325.log
>
>
> When a leader meets some kind of unrecoverable exception (ie: 
> CorruptedIndexException). The shard will go into the readable state and human 
> has to intervene. In that case, it will be the best if the leader gives up 
> its leadership and let other replicas become the leader. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to