Chengxiang Li created HIVE-7780:
-----------------------------------
Summary: Query with OVER clause return duplicate results[Spark
Branch]
Key: HIVE-7780
URL: https://issues.apache.org/jira/browse/HIVE-7780
Project: Hive
Issue Type: Bug
Components: Spark
Reporter: Chengxiang Li
Assignee: Chengxiang Li
A simple query with the OVER clause return duplicate results.
{code:sql}
hive> select address, count(id) over(partition by address) from test;
Query ID = root_20140819150000_f5506fcc-4950-424b-a134-56fc5b06d6eb
Total jobs = 1
Launching Job 1 out of 1
Number of reduce tasks determined at compile time: 1
In order to change the average load for a reducer (in bytes):
set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
set mapreduce.job.reduces=<number>
OK
QD 1
SH 2
SH 2
SZ 2
SZ 2
{code}
--
This message was sent by Atlassian JIRA
(v6.2#6252)