[ https://issues.apache.org/jira/browse/HIVE-8639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14218168#comment-14218168 ]
Chinna Rao Lalam commented on HIVE-8639: ---------------------------------------- Hi [~brocknoland], I am investigating test failures. I need some time for this issue, If folks freeing up they can take it over. > Convert SMBJoin to MapJoin [Spark Branch] > ----------------------------------------- > > Key: HIVE-8639 > URL: https://issues.apache.org/jira/browse/HIVE-8639 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Szehon Ho > Assignee: Chinna Rao Lalam > > HIVE-8202 supports auto-conversion of SMB Join. However, if the tables are > partitioned, there could be a slow down as each mapper would need to get a > very small chunk of a partition which has a single key. Thus, in some > scenarios it's beneficial to convert SMB join to map join. > The task is to research and support the conversion from SMB join to map join > for Spark execution engine. See the equivalent of MapReduce in > SortMergeJoinResolver. -- This message was sent by Atlassian JIRA (v6.3.4#6332)