[ https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
GuangMing Lu updated HIVE-25269: -------------------------------- Attachment: P10IDS_RISKLIST.zip p10ids_riskcon.zip p10ids_realpayrc_ygz.zip p10ids_prerec_split_ygz.zip comb_classcode.zip > When the skew and parallel parameters are true simultaneously, the result is > less data > -------------------------------------------------------------------------------------- > > Key: HIVE-25269 > URL: https://issues.apache.org/jira/browse/HIVE-25269 > Project: Hive > Issue Type: Bug > Components: Physical Optimizer, SQL > Affects Versions: 3.1.0, 3.1.2 > Reporter: GuangMing Lu > Priority: Major > Attachments: P10IDS_RISKLIST.zip, comb_classcode.zip, > p10ids_prerec_split_ygz.zip, p10ids_realpayrc_ygz.zip, p10ids_riskcon.zip, > test.sql > > > When the params of hive.optimize.skewjoin, hive.groupby.skewindata and > hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY > UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be > reduced. Details of SQL and test data can be found in the attachment -- This message was sent by Atlassian Jira (v8.3.4#803005)