Navis created HIVE-6551: --------------------------- Summary: group by after join with skew join optimization references invalid task sometimes Key: HIVE-6551 URL: https://issues.apache.org/jira/browse/HIVE-6551 Project: Hive Issue Type: Bug Reporter: Navis Assignee: Navis Priority: Trivial
For example, {noformat} hive> set hive.auto.convert.join = true; hive> set hive.optimize.skewjoin = true; hive> set hive.skewjoin.key = 3; hive> > EXPLAIN FROM > (SELECT src.* FROM src) x > JOIN > (SELECT src.* FROM src) Y > ON (x.key = Y.key) > SELECT sum(hash(Y.key)), sum(hash(Y.value)); OK STAGE DEPENDENCIES: Stage-8 is a root stage Stage-6 depends on stages: Stage-8 Stage-5 depends on stages: Stage-6 , consists of Stage-4, Stage-2 Stage-4 Stage-2 depends on stages: Stage-4, Stage-1 Stage-0 is a root stage ... {noformat} Stage-2 references not-existing Stage-1 -- This message was sent by Atlassian JIRA (v6.2#6252)