[ https://issues.apache.org/jira/browse/HIVE-8616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Suhas Satish updated HIVE-8616: ------------------------------- Status: Patch Available (was: Open) Attached a patch which addresses this sub-task. With this patch applied, this is the explain plan for a 3-way join. explain select * from table1 join table2 on (table1.key = table2.key) join table3 on table1.key = table3.key; OK STAGE DEPENDENCIES: Stage-1 is a root stage Stage-0 depends on stages: Stage-1 STAGE PLANS: Stage: Stage-1 Spark Edges: Map 1 <- Map 2 (NONE, 0), Map 3 (NONE, 0) DagName: ssatish_20141027131919_0ab004f6-5495-44b4-b7b1-16bf8ca15473:2 Vertices: Map 1 Map Operator Tree: TableScan alias: table1 Statistics: Num rows: 55 Data size: 5812 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: key is not null (type: boolean) Statistics: Num rows: 28 Data size: 2958 Basic stats: COMPLETE Column stats: NONE Map Join Operator condition map: Inner Join 0 to 1 Inner Join 0 to 2 condition expressions: 0 {key} {value} 1 {key} {value} 2 {key} {value} keys: 0 key (type: int) 1 key (type: int) 2 key (type: int) outputColumnNames: _col0, _col1, _col5, _col6, _col10, _col11 input vertices: 1 Map 3 2 Map 2 Statistics: Num rows: 61 Data size: 6507 Basic stats: COMPLETE Column stats: NONE Select Operator expressions: _col0 (type: int), _col1 (type: string), _col5 (type: int), _col6 (type: string), _col10 (type: int), _col11 (type: string) outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5 Statistics: Num rows: 61 Data size: 6507 Basic stats: COMPLETE Column stats: NONE File Output Operator compressed: false Statistics: Num rows: 61 Data size: 6507 Basic stats: COMPLETE Column stats: NONE table: input format: org.apache.hadoop.mapred.TextInputFormat output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe Map 2 Map Operator Tree: TableScan alias: table3 Statistics: Num rows: 1 Data size: 140 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: key is not null (type: boolean) Statistics: Num rows: 1 Data size: 140 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: key (type: int) sort order: + Map-reduce partition columns: key (type: int) Statistics: Num rows: 1 Data size: 140 Basic stats: COMPLETE Column stats: NONE value expressions: value (type: string) Map 3 Map Operator Tree: TableScan alias: table2 Statistics: Num rows: 55 Data size: 5791 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: key is not null (type: boolean) Statistics: Num rows: 28 Data size: 2948 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: key (type: int) sort order: + Map-reduce partition columns: key (type: int) Statistics: Num rows: 28 Data size: 2948 Basic stats: COMPLETE Column stats: NONE value expressions: value (type: string) Stage: Stage-0 Fetch Operator limit: -1 Processor Tree: ListSink > convert joinOp to MapJoinOp and generate MapWorks only > ------------------------------------------------------ > > Key: HIVE-8616 > URL: https://issues.apache.org/jira/browse/HIVE-8616 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Suhas Satish > Assignee: Suhas Satish > Attachments: HIVE-8616-spark.patch > > > This is a sub-task of map join on spark. > The parent jira is > https://issues.apache.org/jira/browse/HIVE-7613 -- This message was sent by Atlassian JIRA (v6.3.4#6332)