[jira] [Created] (HIVE-14593) Non-canonical integer partition columns do not work with IN operations

Harsh J (JIRA) Fri, 19 Aug 2016 21:40:05 -0700

Harsh J created HIVE-14593:
------------------------------

             Summary: Non-canonical integer partition columns do not work with 
IN operations
                 Key: HIVE-14593
                 URL: https://issues.apache.org/jira/browse/HIVE-14593
             Project: Hive
          Issue Type: Bug
          Components: Metastore
    Affects Versions: 1.0.0
            Reporter: Harsh J



The below use-case no longer works (tested on a PostgresQL backed HMS using 
JDO):

{code}
CREATE TABLE foo (a STRING) PARTITIONED BY (b INT, c INT);
ALTER TABLE foo ADD PARTITION (b='07', c='08');
LOAD DATA LOCAL INPATH '/etc/hostname' INTO TABLE foo PARTITION(b='07', c='08');

-- Does not work if you provide a string IN variable:

SELECT a, c FROM foo WHERE b IN ('07');
(No rows selected)

-- Works if you provide it in integer forms:

SELECT a, c FROM foo WHERE b IN (07);
(1 row(s) selected)
SELECT a, c FROM foo WHERE b IN (7);
(1 row(s) selected)
{code}

This worked fine prior to HIVE-8099. The change of HIVE-8099 is inducing a 
double conversion on the partition column input, such that the IN GenericUDFIn 
now receives b's value as a column type converted canonical integer 7, as 
opposed to an as-is DB stored non-canonical value 07. Subsequently the 
GenericUDFIn again up-converts the b's value to match its argument's value 
types instead, making 7 (int) into a string "7". Then, "7" is compared against 
"07" which naturally never matches.

As a regression, this breaks anyone upgrading pre-1.0 to 1.0 or higher.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-14593) Non-canonical integer partition columns do not work with IN operations

Reply via email to