[ 
https://issues.apache.org/jira/browse/HIVE-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14202496#comment-14202496
 ] 

Suhas Satish commented on HIVE-8622:
------------------------------------

[~csun] - We already have a mapr of  BaseWork containing the map-join to its 
parent ReduceSinks. 
This exists as {{linkWorkWithReduceSinkMap}} in {{GenSparkProcContext}}

Do you think we can leverage that in some way, or replace the RSs in that Map 
with the HashTableSinks that we introduced? It looks like we should still 
propagate the whole GenSparkProcContext to the {{SparkMapJoinResolver}} through 
the SparkCompiler.generateTaskTree(...) and {{SparkCompiler.optimizeTaskPlan}}  

All the state information stored there will make life a lot easier. 

> Split map-join plan into 2 SparkTasks in 3 stages [Spark Branch]
> ----------------------------------------------------------------
>
>                 Key: HIVE-8622
>                 URL: https://issues.apache.org/jira/browse/HIVE-8622
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Suhas Satish
>            Assignee: Chao
>         Attachments: HIVE-8622.2-spark.patch, HIVE-8622.3-spark.patch, 
> HIVE-8622.patch
>
>
> This is a sub-task of map-join for spark 
> https://issues.apache.org/jira/browse/HIVE-7613
> This can use the baseline patch for map-join
> https://issues.apache.org/jira/browse/HIVE-8616



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to