[ https://issues.apache.org/jira/browse/HIVE-8205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14145548#comment-14145548 ]
Xuefu Zhang commented on HIVE-8205: ----------------------------------- +1 > Using strings in group type fails in ParquetSerDe > ------------------------------------------------- > > Key: HIVE-8205 > URL: https://issues.apache.org/jira/browse/HIVE-8205 > Project: Hive > Issue Type: Bug > Components: Serializers/Deserializers > Reporter: Mohit Sabharwal > Assignee: Mohit Sabharwal > Labels: parquet > Attachments: HIVE-8205.1.patch, HIVE-8205.1.patch, HIVE-8205.patch > > > In HIVE-7735, schema info was plumbed to ETypeConverter to disambiguate > between hive Char, Varchar and String types, which are all represented as > PrimitiveType "binary" and OriginalType "utf8" in parquet. > However, this does not work for parquet nested types (that map to hive Array, > Map, etc.) containing these values, because schema lookup for nested values > was not implemented. It's also non-trivial to do that in the current parquet > serde implementation. Instead of plumbing in the schema, we should convert > these types to the same Text writeable and let the object inspectors handle > the final conversion. -- This message was sent by Atlassian JIRA (v6.3.4#6332)