[jira] [Commented] (HIVE-5973) SMB joins produce incorrect results with multiple partitions and buckets

Vikram Dixit K (JIRA) Mon, 09 Dec 2013 21:04:33 -0800

    [ 
https://issues.apache.org/jira/browse/HIVE-5973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13843946#comment-13843946
 ]


Vikram Dixit K commented on HIVE-5973:
--------------------------------------

It is quite easy to reproduce this on a cluster but I haven't had success
with our unit tests. I will come up with one and post it here.

Thanks
Vikram.






-- 
Nothing better than when appreciated for hard work.
-Mark


> SMB joins produce incorrect results with multiple partitions and buckets
> ------------------------------------------------------------------------
>
>                 Key: HIVE-5973
>                 URL: https://issues.apache.org/jira/browse/HIVE-5973
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.13.0
>            Reporter: Vikram Dixit K
>            Assignee: Vikram Dixit K
>             Fix For: 0.13.0
>
>
> It looks like there is an issue with re-using the output object array in the 
> select operator. When we read rows of the non-big tables, we hold on to the 
> output object in the priority queue. This causes hive to produce incorrect 
> results because all the elements in the priority queue refer to the same 
> object and the join happens on only one of the buckets.
> {noformat}
> output[i] = eval[i].evaluate(row);
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HIVE-5973) SMB joins produce incorrect results with multiple partitions and buckets

Reply via email to