[ 
https://issues.apache.org/jira/browse/HIVE-15803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15852245#comment-15852245
 ] 

Rajesh Balamohan commented on HIVE-15803:
-----------------------------------------

[~hagleitn] - It is not slow, it gets stuck.  

Another option is to use CachedThreadPool (which would spin up additional 
threads on need basis) instead of FixedThreadPool in executor service.  But 
again, if there are 50000 partitions with nesting, it would end up throwing 
"unable to create native thread".  Passing "null" (instead of pool) 
https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/metadata/HiveMetaStoreChecker.java#L499
 would ensure that only certain amount of threads are there doing the 
processing and any further nesting has to be handled in the same thread 
context. That would still improve the perf compared to no-thread pool scenario 
as 15 threads would be processing instead of 1.

> msck can hang when nested partitions are present
> ------------------------------------------------
>
>                 Key: HIVE-15803
>                 URL: https://issues.apache.org/jira/browse/HIVE-15803
>             Project: Hive
>          Issue Type: Bug
>          Components: Metastore
>            Reporter: Rajesh Balamohan
>            Assignee: Rajesh Balamohan
>            Priority: Minor
>
> Steps to reproduce. 
> {noformat}
> CREATE TABLE `repairtable`( `col` string) PARTITIONED BY (  `p1` string,  
> `p2` string)
> hive> dfs -mkdir -p /apps/hive/warehouse/test.db/repairtable/p1=c/p2=a/p3=b;
> hive> dfs -touchz 
> /apps/hive/warehouse/test.db/repairtable/p1=c/p2=a/p3=b/datafile;
> hive> set hive.mv.files.thread;
> hive.mv.files.thread=15
> hive> set hive.mv.files.thread=1;
> hive> MSCK TABLE repairtable;
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to