[ 
https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15257765#comment-15257765
 ] 

Rui Li commented on HIVE-13572:
-------------------------------

Thanks [~ashutoshc] for the suggestion. The v2 patch sets status using 
threadpool, which provided better performance in my local tests.
I'll come up with another patch to do it in FS. One concern is that we may 
unnecessarily set the permission on intermediate data this way. We can decide 
which way to go when the patch is ready.

> Redundant setting full file status in Hive::copyFiles
> -----------------------------------------------------
>
>                 Key: HIVE-13572
>                 URL: https://issues.apache.org/jira/browse/HIVE-13572
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Rui Li
>            Assignee: Rui Li
>         Attachments: HIVE-13572.1.patch, HIVE-13572.2.patch
>
>
> We set full file status in each copy-file thread. I think it's redundant and 
> hurts performance when we have multiple files to copy.
> {code}
>             if (inheritPerms) {
>               ShimLoader.getHadoopShims().setFullFileStatus(conf, 
> fullDestStatus, destFs, destf);
>             }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to