Gopal V created HIVE-7151: ----------------------------- Summary: NULL keys should not be shuffled for inner equi joins Key: HIVE-7151 URL: https://issues.apache.org/jira/browse/HIVE-7151 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.14.0 Reporter: Gopal V
There is a huge skew in reducer load for NULL keys in shuffle joins, resulting in one slow reducer out of many. The NULL keys will not contribute towards the inner JOIN condition for equality (unless using null-safes). -- This message was sent by Atlassian JIRA (v6.2#6252)