[ https://issues.apache.org/jira/browse/HIVE-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jimmy Xiang updated HIVE-8638: ------------------------------ Attachment: (was: HIVE-8638.1-spark.patch) > Implement bucket map join optimization [Spark Branch] > ----------------------------------------------------- > > Key: HIVE-8638 > URL: https://issues.apache.org/jira/browse/HIVE-8638 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Na Yang > Assignee: Jimmy Xiang > > In the hive-on-mr implementation, bucket map join optimization has to depend > on the map join hint. While in the hive-on-tez implementation, a join can be > automatically converted to bucket map join if certain conditions are met such > as: > 1. the optimization flag hive.convert.join.bucket.mapjoin.tez is ON > 2. all join tables are buckets and each small table's bucket number can be > divided by big table's bucket number > 3. bucket columns == join columns > In the hive-on-spark implementation, it is ideal to have the bucket map join > auto-convertion support. when all the required criteria are met, a join can > be automatically converted to a bucket map join. -- This message was sent by Atlassian JIRA (v6.3.4#6332)