[ https://issues.apache.org/jira/browse/HIVE-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Navis updated HIVE-9618: ------------------------ Attachment: HIVE-9618.3.patch.txt > Deduplicate RS keys for ptf/windowing > ------------------------------------- > > Key: HIVE-9618 > URL: https://issues.apache.org/jira/browse/HIVE-9618 > Project: Hive > Issue Type: Improvement > Components: PTF-Windowing > Reporter: Navis > Assignee: Navis > Priority: Trivial > Attachments: HIVE-9618.1.patch.txt, HIVE-9618.2.patch.txt, > HIVE-9618.3.patch.txt > > > Currently, partition spec containing same column for partition-by and > order-by makes duplicated key column for RS. For example, > {noformat} > explain > select p_mfgr, p_name, p_size, > rank() over (partition by p_mfgr order by p_name) as r, > dense_rank() over (partition by p_mfgr order by p_name) as dr, > sum(p_retailprice) over (partition by p_mfgr order by p_name rows between > unbounded preceding and current row) as s1 > from noop(on noopwithmap(on noop(on part > partition by p_mfgr > order by p_mfgr, p_name > ))) > {noformat} > "partition by p_mfgr order by p_mfgr, p_name" makes duplicated key columns > like below > {noformat} > Reduce Output Operator > key expressions: p_mfgr (type: string), p_mfgr (type: string), p_name > (type: string) > sort order: +++ > Map-reduce partition columns: p_mfgr (type: string) > value expressions: p_size (type: int), p_retailprice (type: double) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)