Rui Li created HIVE-19671:
-----------------------------

             Summary: Distribute by rand() can lead to data inconsistency
                 Key: HIVE-19671
                 URL: https://issues.apache.org/jira/browse/HIVE-19671
             Project: Hive
          Issue Type: Bug
            Reporter: Rui Li


Noticed the following queries can give different results:
{code}
select count(*) from tbl;
select count(*) from (select * from tbl distribute by rand());
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to