[
https://issues.apache.org/jira/browse/SOLR-8456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15533900#comment-15533900
]
Steve Rowe commented on SOLR-8456:
----------------------------------
I see still see non-reproducing failures on my Jenkins about once a day, mostly
like these two, with closed sockets (neither of which is still present on the
other Jenkins servers) - these don't seem to be related to timeouts, so maybe a
separate issue is warranted?:
>From July 10th:
{noformat}
Build: https://builds.apache.org/job/Lucene-Solr-NightlyTests-6.x/116/
6 tests failed.
FAILED: org.apache.solr.cloud.HttpPartitionTest.test
Error Message:
Captured an uncaught exception in thread: Thread[id=25190,
name=SocketProxy-Response-37204:54204, state=RUNNABLE,
group=TGRP-HttpPartitionTest]
Stack Trace:
com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught
exception in thread: Thread[id=25190, name=SocketProxy-Response-37204:54204,
state=RUNNABLE, group=TGRP-HttpPartitionTest]
at
__randomizedtesting.SeedInfo.seed([56EEEACA37CCC526:DEBAD5109930A8DE]:0)
Caused by: java.lang.RuntimeException: java.net.SocketException: Socket is
closed
at __randomizedtesting.SeedInfo.seed([56EEEACA37CCC526]:0)
at
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347)
Caused by: java.net.SocketException: Socket is closed
at java.net.Socket.setSoTimeout(Socket.java:1137)
at
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344)
{noformat}
>From August 26th:
{noformat}
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-master-Windows/6078/
Java: 32bit/jdk1.8.0_102 -server -XX:+UseSerialGC
6 tests failed.
FAILED: org.apache.solr.cloud.HttpPartitionTest.test
Error Message:
Captured an uncaught exception in thread: Thread[id=6241,
name=SocketProxy-Request-60378:60020, state=RUNNABLE,
group=TGRP-HttpPartitionTest]
Stack Trace:
com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught
exception in thread: Thread[id=6241, name=SocketProxy-Request-60378:60020,
state=RUNNABLE, group=TGRP-HttpPartitionTest]
Caused by: java.lang.RuntimeException: java.net.SocketException: Socket is
closed
at __randomizedtesting.SeedInfo.seed([25A5A0B6B8584DC6]:0)
at
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347)
Caused by: java.net.SocketException: Socket is closed
at java.net.Socket.setSoTimeout(Socket.java:1137)
at
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344)
{noformat}
Here's one from yesterday from my Jenkins
[http://jenkins.sarowe.net/job/Lucene-Solr-tests-6.x/2919]:
{noformat}
[junit4] 2> 284268 ERROR
(zkCallback-176-thread-1-processing-n:127.0.0.1:46806_ug_)
[n:127.0.0.1:46806_ug_ c:collMinRf_1x3 s:shard1 r:core_node2
x:collMinRf_1x3_shard1_replica2] o.a.s.c.ShardLeaderElectionContext There was a
problem trying to register as the leader:java.lang.NullPointerException
[junit4] 2> at
org.apache.solr.cloud.ShardLeaderElectionContext.runLeaderProcess(ElectionContext.java:422)
[junit4] 2> at
org.apache.solr.cloud.LeaderElector.runIamLeaderProcess(LeaderElector.java:170)
[junit4] 2> at
org.apache.solr.cloud.LeaderElector.checkIfIamLeader(LeaderElector.java:135)
[junit4] 2> at
org.apache.solr.cloud.LeaderElector.access$200(LeaderElector.java:56)
[junit4] 2> at
org.apache.solr.cloud.LeaderElector$ElectionWatcher.process(LeaderElector.java:348)
[junit4] 2> at
org.apache.solr.common.cloud.SolrZkClient$3.lambda$process$0(SolrZkClient.java:268)
[junit4] 2> at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
[junit4] 2> at
java.util.concurrent.FutureTask.run(FutureTask.java:266)
[junit4] 2> at
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:229)
[junit4] 2> at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
[junit4] 2> at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
[junit4] 2> at java.lang.Thread.run(Thread.java:745)
[...]
[junit4] 2> NOTE: reproduce with: ant test -Dtestcase=HttpPartitionTest
-Dtests.method=test -Dtests.seed=BD012B32E5FEC12B -Dtests.slow=true
-Dtests.locale=de-AT -Dtests.timezone=Asia/Bishkek -Dtests.asserts=true
-Dtests.file.encoding=US-ASCII
[junit4] ERROR 167s J3 | HttpPartitionTest.test <<<
[junit4] > Throwable #1:
com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught
exception in thread: Thread[id=1344, name=SocketProxy-Response-60387:59366,
state=RUNNABLE, group=TGRP-HttpPartitionTest]
[junit4] > at
__randomizedtesting.SeedInfo.seed([BD012B32E5FEC12B:355514E84B02ACD3]:0)
[junit4] > Caused by: java.lang.RuntimeException:
java.net.SocketException: Socket is closed
[junit4] > at
__randomizedtesting.SeedInfo.seed([BD012B32E5FEC12B]:0)
[junit4] > at
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:347)
[junit4] > Caused by: java.net.SocketException: Socket is closed
[junit4] > at java.net.Socket.setSoTimeout(Socket.java:1137)
[junit4] > at
org.apache.solr.cloud.SocketProxy$Bridge$Pump.run(SocketProxy.java:344)
[junit4] 2> 284534 INFO
(SUITE-HttpPartitionTest-seed#[BD012B32E5FEC12B]-worker) [ ]
o.a.s.SolrTestCaseJ4 ###deleteCore
[junit4] 2> NOTE: test params are: codec=Asserting(Lucene62):
{range_facet_l_dv=Lucene50(blocksize=128), _version_=FSTOrd50,
multiDefault=PostingsFormat(name=Memory doPackFST= true), a_t=FSTOrd50,
intDefault=FSTOrd50, id=Lucene50(blocksize=128), range_facet_i_dv=FSTOrd50,
text=PostingsFormat(name=Memory doPackFST= false), range_facet_l=FSTOrd50,
timestamp=FSTOrd50},
docValues:{range_facet_l_dv=DocValuesFormat(name=Lucene54),
range_facet_i_dv=DocValuesFormat(name=Lucene54),
timestamp=DocValuesFormat(name=Lucene54)}, maxPointsInLeafNode=256,
maxMBSortInHeap=5.671099811904124,
sim=RandomSimilarity(queryNorm=false,coord=yes): {}, locale=de-AT,
timezone=Asia/Bishkek
[junit4] 2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation
1.8.0_77 (64-bit)/cpus=16,threads=1,free=351228904,total=524812288
[junit4] 2> NOTE: All tests run in this JVM: [AutoCommitTest,
BlockJoinFacetDistribTest, HdfsDirectoryFactoryTest, CopyFieldTest,
DocExpirationUpdateProcessorFactoryTest, ZkStateReaderTest,
PathHierarchyTokenizerFactoryTest, DateFieldTest, TestTestInjection,
TestSolrQueryParser, TestConfigSetsAPI, HttpPartitionTest]
[junit4] Completed [184/635 (1!)] on J3 in 167.39s, 1 test, 1 error <<<
FAILURES!
{noformat}
> Investigate org.apache.solr.cloud.HttpPartitionTest.test failing more
> commonly.
> -------------------------------------------------------------------------------
>
> Key: SOLR-8456
> URL: https://issues.apache.org/jira/browse/SOLR-8456
> Project: Solr
> Issue Type: Test
> Reporter: Mark Miller
> Assignee: Timothy Potter
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]