[ https://issues.apache.org/jira/browse/HIVE-7555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
J. Tipan Verella resolved HIVE-7555. ------------------------------------ Resolution: Invalid My apologies, This was in fact an issue with the query, as documented on the stackoverflow.com page. > inner join is being resolves as cartesian product > ------------------------------------------------- > > Key: HIVE-7555 > URL: https://issues.apache.org/jira/browse/HIVE-7555 > Project: Hive > Issue Type: Bug > Environment: CentOS > Reporter: J. Tipan Verella > > I believe this is a bug, because I do not seem to be able to find a way > around the following stackoverflow question, > http://stackoverflow.com/questions/25020190/hive-query-returns-cartesian-product-instead-of-inner-join > The issue is as follows (repeated from SO for convenience). > This is type of query I am sending to HIVE: > SELECT BigTable.nicefield,LargeTable.* > FROM LargeTable INNER JOIN BigTable > ON ( > LargeTable.joinfield1of4 = BigTable.joinfield1of4 > AND LargeTable.joinfield2of4 = BigTable.joinfield2of4 > ) > WHERE LargeTable.joinfield3of4=20140726 AND LargeTable.joinfield4of4=15 > AND BigTable.joinfield3of4=20140726 AND BigTable.joinfield4of4=15 > AND LargeTable.filterfiled1of2=123456 > AND LargeTable.filterfiled2of2=98765 > AND LargeTable.joinfield2of4=12 > AND LargeTable.joinfield1of4='iwanttolikehive' > It returns `2418025` rows. The issue is that > SELECT * > FROM LargeTable > WHERE joinfield3of4=20140726 AND joinfield4of4=15 > AND filterfiled1of2=123456 > AND filterfiled2of2=98765 > AND joinfield2of4=12 > AND joinfield1of4='iwanttolikehive' > returns `1555` rows, and so does: > SELECT * > FROM BigTable > WHERE joinfield3of4=20140726 AND joinfield4of4=15 > AND joinfield2of4=12 > AND joinfield1of4='iwanttolikehive' > Note that **1555^2 = 2418025**. > Feel free to discard this issue if it is not a bug, but please provide a > solution on SO. > Thank you. -- This message was sent by Atlassian JIRA (v6.2#6252)