Shuaishuai Nie created HIVE-5023:
------------------------------------
Summary: Hive get wrong result when partition has the same path
but different schema or authority
Key: HIVE-5023
URL: https://issues.apache.org/jira/browse/HIVE-5023
Project: Hive
Issue Type: Bug
Reporter: Shuaishuai Nie
Hive does not differentiate scheme and authority in file uris which cause wrong
result when partition has the same path but different schema or authority. Here
is a simple repro
partition file path:
asv://[email protected]/2013-08-05/00/text1.txt
with content "2013-08-05 00:00:00"
asv://[email protected]/2013-08-05/00/text2.txt
with content "2013-08-05 00:00:20"
{noformat}
CREATE EXTERNAL TABLE IF NOT EXISTS T1 (t STRING) PARTITIONED BY (ProcessDate
STRING, Hour STRING, ClusterName STRING) ROW FORMAT DELIMITED FIELDS TERMINATED
by '\t' STORED AS TEXTFILE;
ALTER TABLE T1 DROP IF EXISTS PARTITION(processDate='2013-08-05', Hour='00',
clusterName ='CLusterA');
ALTER TABLE T1 ADD IF NOT EXISTS PARTITION(processDate='2013-08-05', Hour='00',
clusterName ='ClusterA') LOCATION
'asv://[email protected]/2013-08-05/00';
ALTER TABLE T1 DROP IF EXISTS PARTITION(processDate='2013-08-05', Hour='00',
clusterName ='ClusterB');
ALTER TABLE T1 ADD IF NOT EXISTS PARTITION(processDate='2013-08-05', Hour='00',
clusterName ='ClusterB') LOCATION
'asv://[email protected]/2013-08-05/00';
{noformat}
the expect output of the hive query
{noformat}
SELECT ClusterName, t FROM T1 WHERE ProcessDate=’2013-08-05’ AND Hour=’00’;
{noformat}
should be
{noformat}
ClusterA 2013-08-05 00:00:00
ClusterB 2013-08-05 00:00:20
{noformat}
However it is
{noformat}
ClusterA 2013-08-05 00:00:00
ClusterA 2013-08-05 00:00:20
{noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira