[
https://issues.apache.org/jira/browse/SOLR-8724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15170245#comment-15170245
]
Steve Rowe commented on SOLR-8724:
----------------------------------
I'm getting the same errors when running the older patch too; previous success
with the older patch was on my laptop, but seeing consistent failures on my
server with more test concurrency. So the errors are unrelated to the
parseProperties() implementation.
Here's one (all 5 suites that fail have this pattern: in the process of a
ZkTestServer starting up, it fails to register the connection in the platform
MBean server):
{noformat}
[junit4] Suite: org.apache.solr.handler.TestSolrConfigHandlerConcurrent
[junit4] 2> Creating dataDir:
/home/sarowe/git/lucene-solr/solr/build/solr-core/test/J12/temp/solr.handler.TestSolrConfigHandlerConcurrent_ACC627484AE10FE0-001/init-core-data-001
[junit4] 2> 0 INFO
(SUITE-TestSolrConfigHandlerConcurrent-seed#[ACC627484AE10FE0]-worker) [ ]
o.a.s.SolrTestCaseJ4 Randomized ssl (false) and clientAuth (false)
[junit4] 2> 37 INFO
(SUITE-TestSolrConfigHandlerConcurrent-seed#[ACC627484AE10FE0]-worker) [ ]
o.a.s.BaseDistributedSearchTestCase Setting hostContext system property: /
[junit4] 2> 107 INFO
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]) [ ]
o.a.s.c.ZkTestServer STARTING ZK TEST SERVER
[junit4] 2> 117 INFO (Thread-1) [ ] o.a.s.c.ZkTestServer client
port:0.0.0.0/0.0.0.0:0
[junit4] 2> 117 INFO (Thread-1) [ ] o.a.s.c.ZkTestServer Starting
server
[junit4] 2> 215 INFO
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]) [ ]
o.a.s.c.ZkTestServer start zk server on port:34020
[junit4] 2> 243 INFO
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]) [ ]
o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider
[junit4] 2> 298 INFO
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]) [ ]
o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper
[junit4] 2> 334 ERROR (SyncThread:0) [ ]
o.a.z.s.ZooKeeperCriticalThread Severe unrecoverable error, from thread :
SyncThread:0
[junit4] 2> java.lang.AssertionError
[junit4] 2> at
org.apache.zookeeper.jmx.MBeanRegistry.register(MBeanRegistry.java:93)
[junit4] 2> at
org.apache.zookeeper.server.ServerCnxnFactory.registerConnection(ServerCnxnFactory.java:147)
[junit4] 2> at
org.apache.zookeeper.server.ZooKeeperServer.finishSessionInit(ZooKeeperServer.java:613)
[junit4] 2> at
org.apache.zookeeper.server.FinalRequestProcessor.processRequest(FinalRequestProcessor.java:181)
[junit4] 2> at
org.apache.zookeeper.server.SyncRequestProcessor.flush(SyncRequestProcessor.java:200)
[junit4] 2> at
org.apache.zookeeper.server.SyncRequestProcessor.run(SyncRequestProcessor.java:131)
[junit4] 2> 45314 WARN
(TEST-TestSolrConfigHandlerConcurrent.test-seed#[ACC627484AE10FE0]-SendThread(127.0.0.1:34020))
[ ] o.a.z.ClientCnxn Client session timed out, have not heard from server
in 45007ms for sessionid 0x0
[junit4] 2> NOTE: download the large Jenkins line-docs file by running
'ant get-jenkins-line-docs' in the lucene directory.
[junit4] 2> NOTE: reproduce with: ant test
-Dtestcase=TestSolrConfigHandlerConcurrent -Dtests.method=test
-Dtests.seed=ACC627484AE10FE0 -Dtests.slow=true
-Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt
-Dtests.locale=de-LU -Dtests.timezone=Cuba -Dtests.asserts=true
-Dtests.file.encoding=UTF-8
[junit4] ERROR 45.4s J12 | TestSolrConfigHandlerConcurrent.test <<<
[junit4] > Throwable #1: org.apache.solr.common.SolrException:
java.util.concurrent.TimeoutException: Could not connect to ZooKeeper
127.0.0.1:34020 within 45000 ms
[junit4] > at
__randomizedtesting.SeedInfo.seed([ACC627484AE10FE0:24921892E41D6218]:0)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:181)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:115)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:105)
[junit4] > at
org.apache.solr.cloud.AbstractZkTestCase.buildZooKeeper(AbstractZkTestCase.java:88)
[junit4] > at
org.apache.solr.cloud.AbstractZkTestCase.buildZooKeeper(AbstractZkTestCase.java:82)
[junit4] > at
org.apache.solr.cloud.AbstractDistribZkTestBase.distribSetUp(AbstractDistribZkTestBase.java:75)
[junit4] > at
org.apache.solr.cloud.AbstractFullDistribZkTestBase.distribSetUp(AbstractFullDistribZkTestBase.java:217)
[junit4] > at
org.apache.solr.BaseDistributedSearchTestCase$ShardsRepeatRule$ShardsStatement.evaluate(BaseDistributedSearchTestCase.java:966)
[junit4] > at java.lang.Thread.run(Thread.java:745)
[junit4] > Caused by: java.util.concurrent.TimeoutException: Could not
connect to ZooKeeper 127.0.0.1:34020 within 45000 ms
[junit4] > at
org.apache.solr.common.cloud.ConnectionManager.waitForConnected(ConnectionManager.java:229)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:173)
[junit4] > ... 38 more
[junit4] 2> 45430 INFO
(SUITE-TestSolrConfigHandlerConcurrent-seed#[ACC627484AE10FE0]-worker) [ ]
o.a.s.SolrTestCaseJ4 ###deleteCore
[junit4] 2> NOTE: leaving temporary files on disk at:
/home/sarowe/git/lucene-solr/solr/build/solr-core/test/J12/temp/solr.handler.TestSolrConfigHandlerConcurrent_ACC627484AE10FE0-001
[junit4] 2> Feb 26, 2016 8:03:44 PM
com.carrotsearch.randomizedtesting.ThreadLeakControl checkThreadLeaks
[junit4] 2> WARNING: Will linger awaiting termination of 3 leaked
thread(s).
[junit4] 2> Feb 26, 2016 8:04:04 PM
com.carrotsearch.randomizedtesting.ThreadLeakControl checkThreadLeaks
[junit4] 2> SEVERE: 2 threads leaked from SUITE scope at
org.apache.solr.handler.TestSolrConfigHandlerConcurrent:
[junit4] 2> 1) Thread[id=22, name=Thread-1, state=WAITING,
group=TGRP-TestSolrConfigHandlerConcurrent]
[junit4] 2> at java.lang.Object.wait(Native Method)
[junit4] 2> at java.lang.Thread.join(Thread.java:1245)
[junit4] 2> at java.lang.Thread.join(Thread.java:1319)
[junit4] 2> at
org.apache.zookeeper.server.NIOServerCnxnFactory.join(NIOServerCnxnFactory.java:297)
[junit4] 2> at
org.apache.solr.cloud.ZkTestServer$ZKServerMain.runFromConfig(ZkTestServer.java:310)
[junit4] 2> at
org.apache.solr.cloud.ZkTestServer$2.run(ZkTestServer.java:491)
[junit4] 2> 2) Thread[id=23,
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE,
group=TGRP-TestSolrConfigHandlerConcurrent]
[junit4] 2> at sun.nio.ch.EPollArrayWrapper.epollWait(Native
Method)
[junit4] 2> at
sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:269)
[junit4] 2> at
sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:79)
[junit4] 2> at
sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:86)
[junit4] 2> at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:97)
[junit4] 2> at
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:173)
[junit4] 2> at java.lang.Thread.run(Thread.java:745)
[junit4] 2> Feb 26, 2016 8:04:04 PM
com.carrotsearch.randomizedtesting.ThreadLeakControl tryToInterruptAll
[junit4] 2> INFO: Starting to interrupt leaked threads:
[junit4] 2> 1) Thread[id=22, name=Thread-1, state=WAITING,
group=TGRP-TestSolrConfigHandlerConcurrent]
[junit4] 2> 2) Thread[id=23,
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE,
group=TGRP-TestSolrConfigHandlerConcurrent]
[junit4] 2> 65522 WARN (Thread-1) [ ] o.a.s.c.ZkTestServer Server
interrupted
[junit4] 2> java.lang.InterruptedException
[junit4] 2> at java.lang.Object.wait(Native Method)
[junit4] 2> at java.lang.Thread.join(Thread.java:1245)
[junit4] 2> at java.lang.Thread.join(Thread.java:1319)
[junit4] 2> at
org.apache.zookeeper.server.NIOServerCnxnFactory.join(NIOServerCnxnFactory.java:297)
[junit4] 2> at
org.apache.solr.cloud.ZkTestServer$ZKServerMain.runFromConfig(ZkTestServer.java:310)
[junit4] 2> at
org.apache.solr.cloud.ZkTestServer$2.run(ZkTestServer.java:491)
[junit4] 2> Feb 26, 2016 8:04:07 PM
com.carrotsearch.randomizedtesting.ThreadLeakControl tryToInterruptAll
[junit4] 2> SEVERE: There are still zombie threads that couldn't be
terminated:
[junit4] 2> 1) Thread[id=23,
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE,
group=TGRP-TestSolrConfigHandlerConcurrent]
[junit4] 2> at sun.nio.ch.EPollArrayWrapper.interrupt(Native
Method)
[junit4] 2> at
sun.nio.ch.EPollArrayWrapper.interrupt(EPollArrayWrapper.java:317)
[junit4] 2> at
sun.nio.ch.EPollSelectorImpl.wakeup(EPollSelectorImpl.java:193)
[junit4] 2> at
java.nio.channels.spi.AbstractSelector$1.interrupt(AbstractSelector.java:213)
[junit4] 2> at
java.nio.channels.spi.AbstractSelector.begin(AbstractSelector.java:219)
[junit4] 2> at
sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:78)
[junit4] 2> at
sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:86)
[junit4] 2> at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:97)
[junit4] 2> at
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:173)
[junit4] 2> at java.lang.Thread.run(Thread.java:745)
[junit4] 2> NOTE: test params are:
codec=FastCompressingStoredFields(storedFieldsFormat=CompressingStoredFieldsFormat(compressionMode=FAST,
chunkSize=14609, maxDocsPerChunk=491, blockSize=3),
termVectorsFormat=CompressingTermVectorsFormat(compressionMode=FAST,
chunkSize=14609, blockSize=3)), sim=ClassicSimilarity, locale=de-LU,
timezone=Cuba
[junit4] 2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation
1.8.0_45 (64-bit)/cpus=16,threads=3,free=400229776,total=514850816
[junit4] 2> NOTE: All tests run in this JVM:
[TestSolrConfigHandlerConcurrent]
[junit4] 2> NOTE: download the large Jenkins line-docs file by running
'ant get-jenkins-line-docs' in the lucene directory.
[junit4] 2> NOTE: reproduce with: ant test
-Dtestcase=TestSolrConfigHandlerConcurrent -Dtests.seed=ACC627484AE10FE0
-Dtests.slow=true
-Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt
-Dtests.locale=de-LU -Dtests.timezone=Cuba -Dtests.asserts=true
-Dtests.file.encoding=UTF-8
[junit4] ERROR 0.00s J12 | TestSolrConfigHandlerConcurrent (suite) <<<
[junit4] > Throwable #1:
com.carrotsearch.randomizedtesting.ThreadLeakError: 2 threads leaked from SUITE
scope at org.apache.solr.handler.TestSolrConfigHandlerConcurrent:
[junit4] > 1) Thread[id=22, name=Thread-1, state=WAITING,
group=TGRP-TestSolrConfigHandlerConcurrent]
[junit4] > at java.lang.Object.wait(Native Method)
[junit4] > at java.lang.Thread.join(Thread.java:1245)
[junit4] > at java.lang.Thread.join(Thread.java:1319)
[junit4] > at
org.apache.zookeeper.server.NIOServerCnxnFactory.join(NIOServerCnxnFactory.java:297)
[junit4] > at
org.apache.solr.cloud.ZkTestServer$ZKServerMain.runFromConfig(ZkTestServer.java:310)
[junit4] > at
org.apache.solr.cloud.ZkTestServer$2.run(ZkTestServer.java:491)
[junit4] > 2) Thread[id=23,
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE,
group=TGRP-TestSolrConfigHandlerConcurrent]
[junit4] > at sun.nio.ch.EPollArrayWrapper.epollWait(Native
Method)
[junit4] > at
sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:269)
[junit4] > at
sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:79)
[junit4] > at
sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:86)
[junit4] > at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:97)
[junit4] > at
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:173)
[junit4] > at java.lang.Thread.run(Thread.java:745)
[junit4] > at
__randomizedtesting.SeedInfo.seed([ACC627484AE10FE0]:0)Throwable #2:
com.carrotsearch.randomizedtesting.ThreadLeakError: There are still zombie
threads that couldn't be terminated:
[junit4] > 1) Thread[id=23,
name=NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0, state=RUNNABLE,
group=TGRP-TestSolrConfigHandlerConcurrent]
[junit4] > at sun.nio.ch.EPollArrayWrapper.interrupt(Native
Method)
[junit4] > at
sun.nio.ch.EPollArrayWrapper.interrupt(EPollArrayWrapper.java:317)
[junit4] > at
sun.nio.ch.EPollSelectorImpl.wakeup(EPollSelectorImpl.java:193)
[junit4] > at
java.nio.channels.spi.AbstractSelector$1.interrupt(AbstractSelector.java:213)
[junit4] > at
java.nio.channels.spi.AbstractSelector.begin(AbstractSelector.java:219)
[junit4] > at
sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:78)
[junit4] > at
sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:86)
[junit4] > at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:97)
[junit4] > at
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:173)
[junit4] > at java.lang.Thread.run(Thread.java:745)
[junit4] > at
__randomizedtesting.SeedInfo.seed([ACC627484AE10FE0]:0)
[junit4] Completed [100/580 (1!)] on J12 in 70.76s, 1 test, 3 errors <<<
FAILURES!
{noformat}
> Upgrade Zookeeper to 3.4.8
> --------------------------
>
> Key: SOLR-8724
> URL: https://issues.apache.org/jira/browse/SOLR-8724
> Project: Solr
> Issue Type: Bug
> Reporter: Steve Rowe
> Fix For: master
>
> Attachments: SOLR-8724.patch, SOLR-8724.patch, SOLR-8724.patch
>
>
> Zookeeper 3.4.8 was released a few days ago - we should upgrade.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]