[ https://issues.apache.org/jira/browse/HIVE-17261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16167454#comment-16167454 ]
Junjie Chen commented on HIVE-17261: ------------------------------------ I think the length in create table should specify the maximum length for column. Looks like hive does not write cast values to parquet. Following are parquet file dump, no tail spaces in the end. c = hello v = world d = ACvU da = 57 c = apple v = bee d = AADc da = 50 c = hello v = world d = ACvU da = 57 c = apple v = bee d = AADc da = 50 > Hive use deprecated ParquetInputSplit constructor which blocked parquet > dictionary filter > ----------------------------------------------------------------------------------------- > > Key: HIVE-17261 > URL: https://issues.apache.org/jira/browse/HIVE-17261 > Project: Hive > Issue Type: Improvement > Components: Database/Schema > Affects Versions: 2.2.0 > Reporter: Junjie Chen > Assignee: Junjie Chen > Fix For: 3.0.0 > > Attachments: HIVE-17261.10.patch, HIVE-17261.11.patch, > HIVE-17261.2.patch, HIVE-17261.3.patch, HIVE-17261.4.patch, > HIVE-17261.5.patch, HIVE-17261.6.patch, HIVE-17261.7.patch, > HIVE-17261.8.patch, HIVE-17261.diff, HIVE-17261.patch > > > Hive use deprecated ParquetInputSplit in > [https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/ParquetRecordReaderBase.java#L128] > Please see interface definition in > [https://github.com/apache/parquet-mr/blob/master/parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetInputSplit.java#L80] > Old interface set rowgroupoffset values which will lead to skip dictionary > filter in parquet. -- This message was sent by Atlassian JIRA (v6.4.14#64029)