[ https://issues.apache.org/jira/browse/HIVE-5973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13843946#comment-13843946 ]
Vikram Dixit K commented on HIVE-5973: -------------------------------------- It is quite easy to reproduce this on a cluster but I haven't had success with our unit tests. I will come up with one and post it here. Thanks Vikram. -- Nothing better than when appreciated for hard work. -Mark > SMB joins produce incorrect results with multiple partitions and buckets > ------------------------------------------------------------------------ > > Key: HIVE-5973 > URL: https://issues.apache.org/jira/browse/HIVE-5973 > Project: Hive > Issue Type: Bug > Components: Query Processor > Affects Versions: 0.13.0 > Reporter: Vikram Dixit K > Assignee: Vikram Dixit K > Fix For: 0.13.0 > > > It looks like there is an issue with re-using the output object array in the > select operator. When we read rows of the non-big tables, we hold on to the > output object in the priority queue. This causes hive to produce incorrect > results because all the elements in the priority queue refer to the same > object and the join happens on only one of the buckets. > {noformat} > output[i] = eval[i].evaluate(row); > {noformat} -- This message was sent by Atlassian JIRA (v6.1.4#6159)