[ 
https://issues.apache.org/jira/browse/HIVE-24328?focusedWorklogId=509490&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-509490
 ]

ASF GitHub Bot logged work on HIVE-24328:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 10/Nov/20 04:34
            Start Date: 10/Nov/20 04:34
    Worklog Time Spent: 10m 
      Work Description: aasha commented on a change in pull request #1648:
URL: https://github.com/apache/hive/pull/1648#discussion_r520280447



##########
File path: common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
##########
@@ -647,6 +647,9 @@ private static void populateLlapDaemonVarsSet(Set<String> 
llapDaemonVarsSetLocal
       "Provide the maximum number of partitions of a table that will be 
batched together during  \n"
         + "repl load. All the partitions in a batch will make a single 
metastore call to update the metadata. \n"
         + "The data for these partitions will be copied before copying the 
metadata batch. "),
+    REPL_PARALLEL_COPY_TASKS("hive.repl.parallel.copy.tasks",1000,

Review comment:
       This is kept configurable so that customer can set it according to their 
needs. It will create the threads only if there are that many files to copy in 
parallel. And its using the common pool in the method which is shutdown 
immediately after the task is complete. Even if multiple threads are created, 
those are just worker threads which will shutdown after the tasks are done. 
Customer has to set this config based on their memory allocations.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 509490)
    Time Spent: 1h  (was: 50m)

> Run distcp in parallel for all file entries in repl load.
> ---------------------------------------------------------
>
>                 Key: HIVE-24328
>                 URL: https://issues.apache.org/jira/browse/HIVE-24328
>             Project: Hive
>          Issue Type: Task
>            Reporter: Aasha Medhi
>            Assignee: Aasha Medhi
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: HIVE-24328.01.patch
>
>          Time Spent: 1h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to