[ https://issues.apache.org/jira/browse/HIVE-1641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12924812#action_12924812 ]
Namit Jain commented on HIVE-1641: ---------------------------------- Let us try to get it in, there are some other big patches waiting and it will be difficult for you to maintain this > add map joined table to distributed cache > ----------------------------------------- > > Key: HIVE-1641 > URL: https://issues.apache.org/jira/browse/HIVE-1641 > Project: Hive > Issue Type: Improvement > Components: Query Processor > Affects Versions: 0.7.0 > Reporter: Namit Jain > Assignee: Liyin Tang > Fix For: 0.7.0 > > Attachments: Hive-1641(3).txt, Hive-1641(4).patch, > Hive-1641(5).patch, Hive-1641.patch > > > Currently, the mappers directly read the map-joined table from HDFS, which > makes it difficult to scale. > We end up getting lots of timeouts once the number of mappers are beyond a > few thousand, due to > concurrent mappers. > It would be good idea to put the mapped file into distributed cache and read > from there instead. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.