[ https://issues.apache.org/jira/browse/HIVE-14294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gabor Szadovszky updated HIVE-14294: ------------------------------------ Attachment: HIVE-14294.patch > HiveSchemaConverter for Parquet doesn't translate TINYINT and SMALLINT into > proper Parquet types > ------------------------------------------------------------------------------------------------ > > Key: HIVE-14294 > URL: https://issues.apache.org/jira/browse/HIVE-14294 > Project: Hive > Issue Type: Bug > Affects Versions: 1.2.1, 2.1.0 > Reporter: Cheng Lian > Assignee: Gabor Szadovszky > Attachments: HIVE-14294.patch > > > To reproduce this issue, run the following DDL: > {code:sql} > CREATE TABLE foo STORED AS PARQUET AS SELECT CAST(1 AS TINYINT); > {code} > And then check the schema of the written Parquet file: > {noformat} > $ parquet-schema $WAREHOUSE_PATH/foo/000000_0 > message hive_schema { > optional int32 _c0; > } > {noformat} > When translating Hive types into Parquet types, {{TINYINT}} and {{SMALLINT}} > should be translated into the {{int32 (INT_8)}} and {{int32 (INT_16)}} > respectively. However, {{HiveSchemaConverter}} converts all of {{TINYINT}}, > {{SMALLINT}}, and {{INT}} into Parquet {{int32}}. This causes problem when > accessing Parquet files generated by Hive in other systems since type > information gets wrong. -- This message was sent by Atlassian JIRA (v6.3.4#6332)