[ 
https://issues.apache.org/jira/browse/SOLR-8724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15170245#comment-15170245
 ] 

Steve Rowe commented on SOLR-8724:
----------------------------------

I'm getting the same errors when running the older patch too; previous success 
with the older patch was on my laptop, but seeing consistent failures on my 
server with more test concurrency.  So the errors are unrelated to the 
parseProperties() implementation.

Here's one (all 5 suites that fail have this pattern: in the process of a 
ZkTestServer starting up, it fails to register the connection in the platform 
MBean server):

{noformat}
   [junit4] Suite: org.apache.solr.handler.TestSolrConfigHandlerConcurrent
   [junit4]   2> Creating dataDir: 
/home/sarowe/git/lucene-solr/solr/build/solr-core/test/J12/temp/solr.handler.TestSolrConfigHandlerConcurrent_ACC627484AE10FE0-001/init-core-data-001
   [junit4]   2> 0    INFO  
(SUITE-TestSolrConfigHandlerConcurrent-seed#[ACC627484AE10FE0]-worker) [    ] 
o.a.s.SolrTestCaseJ4 Randomized ssl (false) and clientAuth (false)
   [junit4]   2> 37   INFO  
(SUITE-TestSolrConfigHandlerConcurrent-seed#[ACC627484AE10FE0]-worker) [    ] 
o.a.s.BaseDistributedSearchTestCase Setting hostContext system property: /
   [junit4]   2> 107  INFO  
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]) [    ] 
o.a.s.c.ZkTestServer STARTING ZK TEST SERVER
   [junit4]   2> 117  INFO  (Thread-1) [    ] o.a.s.c.ZkTestServer client 
port:0.0.0.0/0.0.0.0:0
   [junit4]   2> 117  INFO  (Thread-1) [    ] o.a.s.c.ZkTestServer Starting 
server
   [junit4]   2> 215  INFO  
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]) [    ] 
o.a.s.c.ZkTestServer start zk server on port:34020
   [junit4]   2> 243  INFO  
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider
   [junit4]   2> 298  INFO  
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]) [    ] 
o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper
   [junit4]   2> 334  ERROR (SyncThread:0) [    ] 
o.a.z.s.ZooKeeperCriticalThread Severe unrecoverable error, from thread : 
SyncThread:0
   [junit4]   2> java.lang.AssertionError
   [junit4]   2>        at 
org.apache.zookeeper.jmx.MBeanRegistry.register(MBeanRegistry.java:93)
   [junit4]   2>        at 
org.apache.zookeeper.server.ServerCnxnFactory.registerConnection(ServerCnxnFactory.java:147)
   [junit4]   2>        at 
org.apache.zookeeper.server.ZooKeeperServer.finishSessionInit(ZooKeeperServer.java:613)
   [junit4]   2>        at 
org.apache.zookeeper.server.FinalRequestProcessor.processRequest(FinalRequestProcessor.java:181)
   [junit4]   2>        at 
org.apache.zookeeper.server.SyncRequestProcessor.flush(SyncRequestProcessor.java:200)
   [junit4]   2>        at 
org.apache.zookeeper.server.SyncRequestProcessor.run(SyncRequestProcessor.java:131)
   [junit4]   2> 45314 WARN  
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]-SendThread(127.0.0.1:34020))
 [    ] o.a.z.ClientCnxn Client session timed out, have not heard from server 
in 45007ms for sessionid 0x0
   [junit4]   2> NOTE: download the large Jenkins line-docs file by running 
'ant get-jenkins-line-docs' in the lucene directory.
   [junit4]   2> NOTE: reproduce with: ant test  
-Dtestcase=TestSolrConfigHandlerConcurrent -Dtests.method=test 
-Dtests.seed=ACC627484AE10FE0 -Dtests.slow=true 
-Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt 
-Dtests.locale=de-LU -Dtests.timezone=Cuba -Dtests.asserts=true 
-Dtests.file.encoding=UTF-8
   [junit4] ERROR   45.4s J12 | TestSolrConfigHandlerConcurrent.test <<<
   [junit4]    > Throwable #1: org.apache.solr.common.SolrException: 
java.util.concurrent.TimeoutException: Could not connect to ZooKeeper 
127.0.0.1:34020 within 45000 ms
   [junit4]    >        at 
__randomizedtesting.SeedInfo.seed([ACC627484AE10FE0:24921892E41D6218]:0)
   [junit4]    >        at 
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:181)
   [junit4]    >        at 
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:115)
   [junit4]    >        at 
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:105)
   [junit4]    >        at 
org.apache.solr.cloud.AbstractZkTestCase.buildZooKeeper(AbstractZkTestCase.java:88)
   [junit4]    >        at 
org.apache.solr.cloud.AbstractZkTestCase.buildZooKeeper(AbstractZkTestCase.java:82)
   [junit4]    >        at 
org.apache.solr.cloud.AbstractDistribZkTestBase.distribSetUp(AbstractDistribZkTestBase.java:75)
   [junit4]    >        at 
org.apache.solr.cloud.AbstractFullDistribZkTestBase.distribSetUp(AbstractFullDistribZkTestBase.java:217)
   [junit4]    >        at 
org.apache.solr.BaseDistributedSearchTestCase$ShardsRepeatRule$ShardsStatement.evaluate(BaseDistributedSearchTestCase.java:966)
   [junit4]    >        at java.lang.Thread.run(Thread.java:745)
   [junit4]    > Caused by: java.util.concurrent.TimeoutException: Could not 
connect to ZooKeeper 127.0.0.1:34020 within 45000 ms
   [junit4]    >        at 
org.apache.solr.common.cloud.ConnectionManager.waitForConnected(ConnectionManager.java:229)
   [junit4]    >        at 
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:173)
   [junit4]    >        ... 38 more
   [junit4]   2> 45430 INFO  
(SUITE-TestSolrConfigHandlerConcurrent-seed#[ACC627484AE10FE0]-worker) [    ] 
o.a.s.SolrTestCaseJ4 ###deleteCore
   [junit4]   2> NOTE: leaving temporary files on disk at: 
/home/sarowe/git/lucene-solr/solr/build/solr-core/test/J12/temp/solr.handler.TestSolrConfigHandlerConcurrent_ACC627484AE10FE0-001
   [junit4]   2> Feb 26, 2016 8:03:44 PM 
com.carrotsearch.randomizedtesting.ThreadLeakControl checkThreadLeaks
   [junit4]   2> WARNING: Will linger awaiting termination of 3 leaked 
thread(s).
   [junit4]   2> Feb 26, 2016 8:04:04 PM 
com.carrotsearch.randomizedtesting.ThreadLeakControl checkThreadLeaks
   [junit4]   2> SEVERE: 2 threads leaked from SUITE scope at 
org.apache.solr.handler.TestSolrConfigHandlerConcurrent: 
   [junit4]   2>    1) Thread[id=22, name=Thread-1, state=WAITING, 
group=TGRP-TestSolrConfigHandlerConcurrent]
   [junit4]   2>         at java.lang.Object.wait(Native Method)
   [junit4]   2>         at java.lang.Thread.join(Thread.java:1245)
   [junit4]   2>         at java.lang.Thread.join(Thread.java:1319)
   [junit4]   2>         at 
org.apache.zookeeper.server.NIOServerCnxnFactory.join(NIOServerCnxnFactory.java:297)
   [junit4]   2>         at 
org.apache.solr.cloud.ZkTestServer$ZKServerMain.runFromConfig(ZkTestServer.java:310)
   [junit4]   2>         at 
org.apache.solr.cloud.ZkTestServer$2.run(ZkTestServer.java:491)
   [junit4]   2>    2) Thread[id=23, 
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE, 
group=TGRP-TestSolrConfigHandlerConcurrent]
   [junit4]   2>         at sun.nio.ch.EPollArrayWrapper.epollWait(Native 
Method)
   [junit4]   2>         at 
sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:269)
   [junit4]   2>         at 
sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:79)
   [junit4]   2>         at 
sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:86)
   [junit4]   2>         at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:97)
   [junit4]   2>         at 
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:173)
   [junit4]   2>         at java.lang.Thread.run(Thread.java:745)
   [junit4]   2> Feb 26, 2016 8:04:04 PM 
com.carrotsearch.randomizedtesting.ThreadLeakControl tryToInterruptAll
   [junit4]   2> INFO: Starting to interrupt leaked threads:
   [junit4]   2>    1) Thread[id=22, name=Thread-1, state=WAITING, 
group=TGRP-TestSolrConfigHandlerConcurrent]
   [junit4]   2>    2) Thread[id=23, 
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE, 
group=TGRP-TestSolrConfigHandlerConcurrent]
   [junit4]   2> 65522 WARN  (Thread-1) [    ] o.a.s.c.ZkTestServer Server 
interrupted
   [junit4]   2> java.lang.InterruptedException
   [junit4]   2>        at java.lang.Object.wait(Native Method)
   [junit4]   2>        at java.lang.Thread.join(Thread.java:1245)
   [junit4]   2>        at java.lang.Thread.join(Thread.java:1319)
   [junit4]   2>        at 
org.apache.zookeeper.server.NIOServerCnxnFactory.join(NIOServerCnxnFactory.java:297)
   [junit4]   2>        at 
org.apache.solr.cloud.ZkTestServer$ZKServerMain.runFromConfig(ZkTestServer.java:310)
   [junit4]   2>        at 
org.apache.solr.cloud.ZkTestServer$2.run(ZkTestServer.java:491)
  [junit4]   2> Feb 26, 2016 8:04:07 PM 
com.carrotsearch.randomizedtesting.ThreadLeakControl tryToInterruptAll
   [junit4]   2> SEVERE: There are still zombie threads that couldn't be 
terminated:
   [junit4]   2>    1) Thread[id=23, 
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE, 
group=TGRP-TestSolrConfigHandlerConcurrent]
   [junit4]   2>         at sun.nio.ch.EPollArrayWrapper.interrupt(Native 
Method)
   [junit4]   2>         at 
sun.nio.ch.EPollArrayWrapper.interrupt(EPollArrayWrapper.java:317)
   [junit4]   2>         at 
sun.nio.ch.EPollSelectorImpl.wakeup(EPollSelectorImpl.java:193)
   [junit4]   2>         at 
java.nio.channels.spi.AbstractSelector$1.interrupt(AbstractSelector.java:213)
   [junit4]   2>         at 
java.nio.channels.spi.AbstractSelector.begin(AbstractSelector.java:219)
   [junit4]   2>         at 
sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:78)
   [junit4]   2>         at 
sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:86)
   [junit4]   2>         at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:97)
   [junit4]   2>         at 
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:173)
   [junit4]   2>         at java.lang.Thread.run(Thread.java:745)
   [junit4]   2> NOTE: test params are: 
codec=FastCompressingStoredFields(storedFieldsFormat=CompressingStoredFieldsFormat(compressionMode=FAST,
 chunkSize=14609, maxDocsPerChunk=491, blockSize=3), 
termVectorsFormat=CompressingTermVectorsFormat(compressionMode=FAST, 
chunkSize=14609, blockSize=3)), sim=ClassicSimilarity, locale=de-LU, 
timezone=Cuba
   [junit4]   2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation 
1.8.0_45 (64-bit)/cpus=16,threads=3,free=400229776,total=514850816
   [junit4]   2> NOTE: All tests run in this JVM: 
[TestSolrConfigHandlerConcurrent]
   [junit4]   2> NOTE: download the large Jenkins line-docs file by running 
'ant get-jenkins-line-docs' in the lucene directory.
   [junit4]   2> NOTE: reproduce with: ant test  
-Dtestcase=TestSolrConfigHandlerConcurrent -Dtests.seed=ACC627484AE10FE0 
-Dtests.slow=true 
-Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt 
-Dtests.locale=de-LU -Dtests.timezone=Cuba -Dtests.asserts=true 
-Dtests.file.encoding=UTF-8
   [junit4] ERROR   0.00s J12 | TestSolrConfigHandlerConcurrent (suite) <<<
   [junit4]    > Throwable #1: 
com.carrotsearch.randomizedtesting.ThreadLeakError: 2 threads leaked from SUITE 
scope at org.apache.solr.handler.TestSolrConfigHandlerConcurrent: 
   [junit4]    >    1) Thread[id=22, name=Thread-1, state=WAITING, 
group=TGRP-TestSolrConfigHandlerConcurrent]
   [junit4]    >         at java.lang.Object.wait(Native Method)
   [junit4]    >         at java.lang.Thread.join(Thread.java:1245)
   [junit4]    >         at java.lang.Thread.join(Thread.java:1319)
   [junit4]    >         at 
org.apache.zookeeper.server.NIOServerCnxnFactory.join(NIOServerCnxnFactory.java:297)
   [junit4]    >         at 
org.apache.solr.cloud.ZkTestServer$ZKServerMain.runFromConfig(ZkTestServer.java:310)
   [junit4]    >         at 
org.apache.solr.cloud.ZkTestServer$2.run(ZkTestServer.java:491)
   [junit4]    >    2) Thread[id=23, 
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE, 
group=TGRP-TestSolrConfigHandlerConcurrent]
   [junit4]    >         at sun.nio.ch.EPollArrayWrapper.epollWait(Native 
Method)
   [junit4]    >         at 
sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:269)
   [junit4]    >         at 
sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:79)
   [junit4]    >         at 
sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:86)
   [junit4]    >         at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:97)
   [junit4]    >         at 
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:173)
   [junit4]    >         at java.lang.Thread.run(Thread.java:745)
   [junit4]    >        at 
__randomizedtesting.SeedInfo.seed([ACC627484AE10FE0]:0)Throwable #2: 
com.carrotsearch.randomizedtesting.ThreadLeakError: There are still zombie 
threads that couldn't be terminated:
   [junit4]    >    1) Thread[id=23, 
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE, 
group=TGRP-TestSolrConfigHandlerConcurrent]
   [junit4]    >         at sun.nio.ch.EPollArrayWrapper.interrupt(Native 
Method)
   [junit4]    >         at 
sun.nio.ch.EPollArrayWrapper.interrupt(EPollArrayWrapper.java:317)
   [junit4]    >         at 
sun.nio.ch.EPollSelectorImpl.wakeup(EPollSelectorImpl.java:193)
   [junit4]    >         at 
java.nio.channels.spi.AbstractSelector$1.interrupt(AbstractSelector.java:213)
   [junit4]    >         at 
java.nio.channels.spi.AbstractSelector.begin(AbstractSelector.java:219)
   [junit4]    >         at 
sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:78)
   [junit4]    >         at 
sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:86)
   [junit4]    >         at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:97)
   [junit4]    >         at 
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:173)
   [junit4]    >         at java.lang.Thread.run(Thread.java:745)
   [junit4]    >        at 
__randomizedtesting.SeedInfo.seed([ACC627484AE10FE0]:0)
   [junit4] Completed [100/580 (1!)] on J12 in 70.76s, 1 test, 3 errors <<< 
FAILURES!
{noformat}

> Upgrade Zookeeper to 3.4.8
> --------------------------
>
>                 Key: SOLR-8724
>                 URL: https://issues.apache.org/jira/browse/SOLR-8724
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Steve Rowe
>             Fix For: master
>
>         Attachments: SOLR-8724.patch, SOLR-8724.patch, SOLR-8724.patch
>
>
> Zookeeper 3.4.8 was released a few days ago - we should upgrade.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to