[ https://issues.apache.org/jira/browse/HIVE-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Phabricator updated HIVE-2466: ------------------------------ Attachment: D285.1.patch njain requested code review of "HIVE-2466 [jira] mapjoin_subquery dump small table (mapjoin table) to the same file". Reviewers: JIRA HIVE-2466 diff for review in mapjoin_subquery.q there is a query: SELECT /*+ MAPJOIN(z) */ subq.key1, z.value FROM (SELECT /*+ MAPJOIN<img class="emoticon" src="https://issues.apache.org/jira/images/icons/emoticons/error.gif" height="16" width="16" align="absmiddle" alt="" border="0"/> */ x.key as key1, x.value as value1, y.key as key2, y.value as value2 FROM src1 x JOIN src y ON (x.key = y.key)) subq JOIN srcpart z ON (subq.key1 = z.key and z.ds='2008-04-08' and z.hr=11); when dump x and z to a local file,there all dump to the same file, so we lost the data of x TEST PLAN EMPTY REVISION DETAIL https://reviews.facebook.net/D285 AFFECTED FILES data/files/x.txt data/files/y.txt data/files/z.txt ql/src/test/results/clientpositive/mapjoin_subquery2.q.out ql/src/test/queries/clientpositive/mapjoin_subquery2.q ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/GenMRSkewJoinProcessor.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/MapJoinProcessor.java ql/src/java/org/apache/hadoop/hive/ql/exec/HashTableSinkOperator.java ql/src/java/org/apache/hadoop/hive/ql/exec/MapredLocalTask.java.orig ql/src/java/org/apache/hadoop/hive/ql/exec/MapredLocalTask.java ql/src/java/org/apache/hadoop/hive/ql/exec/HashTableSinkOperator.java.orig ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java ql/src/java/org/apache/hadoop/hive/ql/plan/HashTableSinkDesc.java ql/src/java/org/apache/hadoop/hive/ql/plan/PlanUtils.java ql/src/java/org/apache/hadoop/hive/ql/plan/MapJoinDesc.java MANAGE HERALD DIFFERENTIAL RULES https://reviews.facebook.net/herald/view/differential/ WHY DID I GET THIS EMAIL? https://reviews.facebook.net/herald/transcript/591/ Tip: use the X-Herald-Rules header to filter Herald messages in your client. > mapjoin_subquery dump small table (mapjoin table) to the same file > ------------------------------------------------------------------- > > Key: HIVE-2466 > URL: https://issues.apache.org/jira/browse/HIVE-2466 > Project: Hive > Issue Type: Bug > Components: Query Processor > Affects Versions: 0.7.1 > Reporter: binlijin > Assignee: binlijin > Priority: Blocker > Attachments: D285.1.patch, hive-2466.1.patch, hive-2466.2.patch, > hive-2466.3.patch > > > in mapjoin_subquery.q there is a query: > SELECT /*+ MAPJOIN(z) */ subq.key1, z.value > FROM > (SELECT /*+ MAPJOIN(x) */ x.key as key1, x.value as value1, y.key as key2, > y.value as value2 > FROM src1 x JOIN src y ON (x.key = y.key)) subq > JOIN srcpart z ON (subq.key1 = z.key and z.ds='2008-04-08' and z.hr=11); > when dump x and z to a local file,there all dump to the same file, so we lost > the data of x -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira