[ https://issues.apache.org/jira/browse/HIVE-11032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14602367#comment-14602367 ]
Rui Li commented on HIVE-11032: ------------------------------- Hi [~mohitsabharwal], thanks for the work. For the newly generated golden files, have you verified that the query plan is inline with the MR version? Basically we'll have an extra stage to do the group by, using {{rand()}} as the partitioner. > Enable more tests for grouping by skewed data [Spark Branch] > ------------------------------------------------------------ > > Key: HIVE-11032 > URL: https://issues.apache.org/jira/browse/HIVE-11032 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Rui Li > Assignee: Mohit Sabharwal > Priority: Minor > Attachments: HIVE-11032.1-spark.patch, HIVE-11032.2-spark.patch > > > Not all of such tests are enabled, e.g. {{groupby1_map_skew.q}}. We can use > this JIRA to track whether we need more of them. > Basically, we need to look at all tests with {{set > hive.groupby.skewindata=true;}}. -- This message was sent by Atlassian JIRA (v6.3.4#6332)