marymwu created HIVE-14160:
------------------------------
Summary: Reduce-task costs a long time to finish on the condition
that the certain sql "select a,distinct(b) group by a" has been executed on the
data which has skew distribution
Key: HIVE-14160
URL: https://issues.apache.org/jira/browse/HIVE-14160
Project: Hive
Issue Type: Improvement
Components: hpl/sql
Affects Versions: 1.1.0
Reporter: marymwu
Reduce-task costs a long time to finish on the condition that the certain sql
"select a,distinct(b) group by a" has been executed on the data which has skew
distribution
data scale: 64G
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)