Sergey Shelukhin created HIVE-16051:
---------------------------------------

             Summary: MM tables: skewjoin test fails
                 Key: HIVE-16051
                 URL: https://issues.apache.org/jira/browse/HIVE-16051
             Project: Hive
          Issue Type: Sub-task
            Reporter: Sergey Shelukhin


{noformat}
set hive.optimize.skewjoin = true;
set hive.skewjoin.key = 2;
set hive.optimize.metadataonly=false;

CREATE TABLE dest_j1(key INT, value STRING) STORED AS TEXTFILE tblproperties 
("transactional"="true", "transactional_properties"="insert_only");

FROM src src1 JOIN src src2 ON (src1.key = src2.key)
INSERT OVERWRITE TABLE dest_j1 SELECT src1.key, src2.value;

select count(distinct key) from dest_j1;
{noformat}
Different results for MM and non-MM table.

Probably has something to do with how skewjoin handles files; however, looking 
at MM/debugging logs, there are no suspicious deletes, and everything looks the 
same for both cases; all the logging for skewjoin row containers and stuff is 
identical between the two runs (except for the numbers/guids; the number of 
files, paths, etc. are all the same). So not sure what's going on. Probably dfs 
dump can answer this question, but it doesn't work for me currently on q files.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to