[ https://issues.apache.org/jira/browse/HIVE-15879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15884045#comment-15884045 ]
Hive QA commented on HIVE-15879: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12854556/HIVE-15879.04.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 3 failed/errored test(s), 10265 tests executed *Failed tests:* {noformat} TestDerbyConnector - did not produce a TEST-*.xml file (likely timed out) (batchId=235) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query14] (batchId=223) org.apache.hive.beeline.TestBeeLineWithArgs.testQueryProgressParallel (batchId=211) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/3771/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/3771/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-3771/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 3 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12854556 - PreCommit-HIVE-Build > Fix HiveMetaStoreChecker.checkPartitionDirs method > -------------------------------------------------- > > Key: HIVE-15879 > URL: https://issues.apache.org/jira/browse/HIVE-15879 > Project: Hive > Issue Type: Bug > Reporter: Vihang Karajgaonkar > Assignee: Vihang Karajgaonkar > Attachments: HIVE-15879.01.patch, HIVE-15879.02.patch, > HIVE-15879.03.patch, HIVE-15879.04.patch > > > HIVE-15803 fixes the msck hang issue in > HiveMetaStoreChecker.checkPartitionDirs method by adding a check to see if > the Threadpool has any spare threads. If not it uses single threaded listing > of the files. > {noformat} > if (pool != null) { > synchronized (pool) { > // In case of recursive calls, it is possible to deadlock with TP. > Check TP usage here. > if (pool.getActiveCount() < pool.getMaximumPoolSize()) { > useThreadPool = true; > } > if (!useThreadPool) { > if (LOG.isDebugEnabled()) { > LOG.debug("Not using threadPool as active count:" + > pool.getActiveCount() > + ", max:" + pool.getMaximumPoolSize()); > } > } > } > } > {noformat} > Based on the java doc of getActiveCount() below > bq. Returns the approximate number of threads that are actively executing > tasks. > it returns only approximate number of threads and it cannot be guaranteed > that it always returns the exact number of active threads. This still exposes > the method implementation to the msck hang bug in rare corner cases. > We could either: > 1. Use a atomic counter to track exactly how many threads are actively running > 2. Relook at the method itself to make it much simpler. Like eg, look into > the possibility of changing the recursive implementation to an iterative > implementation where worker threads pick tasks from a queue until the queue > is empty. -- This message was sent by Atlassian JIRA (v6.3.15#6346)