[ https://issues.apache.org/jira/browse/HIVE-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12932666#action_12932666 ]
Namit Jain commented on HIVE-1642: ---------------------------------- hive-default.xml 477 <property> 478 <name>hive.mapjoin.hashtable.threshold</name> 479 <value>100000</value> 480 <description>the threshold for the mapjoin hashtable</description> 481 </property> 482 483 <property> 484 <name>hive.mapjoin.hashtable.loadfactor</name> 485 <value>0.75</value> 486 <description>the load factor for the mapjoin hashtable</description> 487 </property> 488 489 <property> 490 <name>hive.mapjoin.smalltable.filesize</name> 491 <value>25000000</value> 492 <description>The threshold for the input file size of the small tables; if the file size is smaller than this threshold, it will try to concert the common join into map join</description> 493 </property> 494 495 <property> 496 <name>hive.mapjoin.localtask.max.memory.usage</name> 497 <value>0.90</value> 498 <description>The max memory usage of the local task for map join</description> 499 </property> 500 Add more comments for the 1,2 and 4 properties. spelling mistake in the third: concert -> convert Uncheckout DriverContext.java Why should backup task be obtained from the resolver ? It can be created at task creation time itself ? > Convert join queries to map-join based on size of table/row > ----------------------------------------------------------- > > Key: HIVE-1642 > URL: https://issues.apache.org/jira/browse/HIVE-1642 > Project: Hive > Issue Type: Improvement > Components: Query Processor > Reporter: Namit Jain > Assignee: Liyin Tang > Fix For: 0.7.0 > > Attachments: hive_1642_1.patch, hive_1642_2.patch, hive_1642_4.patch > > > Based on the number of rows and size of each table, Hive should automatically > be able to convert a join into map-join. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.