Cheng Lian created HIVE-14294:
---------------------------------

             Summary: HiveSchemaConverter for Parquet doesn't translate TINYINT 
and SMALLINT into proper Parquet types
                 Key: HIVE-14294
                 URL: https://issues.apache.org/jira/browse/HIVE-14294
             Project: Hive
          Issue Type: Bug
    Affects Versions: 2.1.0, 1.2.1
            Reporter: Cheng Lian


To reproduce this issue, run the following DDL:

{code:sql}
CREATE TABLE foo STORED AS PARQUET AS SELECT CAST(1 AS TINYINT);
{code}

And then check the schema of the written Parquet file:

{noformat}
$ parquet-schema $WAREHOUSE_PATH/foo/000000_0
message hive_schema {
  optional int32 _c0;
}
{noformat}

When translating Hive types into Parquet types, {{TINYINT}} and {{SMALLINT}} 
should be translated into the {{int32 (INT_8)}} and {{int32 (INT_16)}} 
respectively. However, {{HiveSchemaConverter}} converts all of {{TINYINT}}, 
{{SMALLINT}}, and {{INT}} into Parquet {{int32}}. This causes problem when 
accessing Parquet files generated by Hive in other systems since type 
information gets wrong.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to