[jira] [Commented] (HIVE-8042) Optionally allow move tasks to run in parallel

Gunther Hagleitner (JIRA) Wed, 10 Sep 2014 12:54:55 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-8042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129001#comment-14129001
 ]


Gunther Hagleitner commented on HIVE-8042:
------------------------------------------

[~xuefuz] I'm torn about this one. I don't like the large number of configs we 
have, but this is changing the behavior of hive.exec.parallel and I wanted to 
make sure folks can fall back to the old behavior. Bug defense and maybe 
situations where you don't want to do that much work in parallel on the client 
machine (e.g.: HS2).

> Optionally allow move tasks to run in parallel
> ----------------------------------------------
>
>                 Key: HIVE-8042
>                 URL: https://issues.apache.org/jira/browse/HIVE-8042
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Gunther Hagleitner
>            Assignee: Gunther Hagleitner
>         Attachments: HIVE-8042.1.patch
>
>
> hive.exec.parallel allows one to run different stages of a query in parallel. 
> However that applies only to map-reduce tasks. When using large multi insert 
> queries there are many MoveTasks that are all executed in sequence on the 
> client. There's no real reason for that - they could be run in parallel as 
> well (i.e.: the stage graph captures the dependencies and knows which tasks 
> can happen in parallel)
> Proposal here is to create a new config var:
> hive.exec.parallel.movetask
> When set to true it run movetasks in parallel also.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-8042) Optionally allow move tasks to run in parallel

Reply via email to