[ 
https://issues.apache.org/jira/browse/HIVE-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-1047:
---------------------------------

    Component/s: Query Processor
     Issue Type: Improvement  (was: Bug)

> Merge tasks in GenMRUnion1
> --------------------------
>
>                 Key: HIVE-1047
>                 URL: https://issues.apache.org/jira/browse/HIVE-1047
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.6.0
>            Reporter: Ning Zhang
>            Assignee: Ning Zhang
>             Fix For: 0.6.0
>
>         Attachments: HIVE-1047.patch
>
>
> In the following query:
> from (select * from src  union all select * from src) s
> insert overwrite table src_multi1 select * where key < 10
> insert overwrite table src_multi2 select * where key > 10 and key < 20;
> There are two topOps (TableScaneOperator) for the same MapRed task. In 
> genTableScan1, each TableScanOperator will create a new task as currTask. The 
> genMRUnion1 should merge two tasks into one. Currently GenMRUnion1 does not 
> merge currTask, this will cause down stream operators like genFileSink1 to  
> do some hacks to effectively merge the two tasks. A cleaner way is to merge 
> the tasks in GenMRUnion1 as done by join operators etc. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to