[ https://issues.apache.org/jira/browse/HIVE-6783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13958229#comment-13958229 ]
Xuefu Zhang commented on HIVE-6783: ----------------------------------- Patch committed to trunk. Thanks to Tongjie for the contribution. BTW, it seems that you're not in the contributor list so I'm not able to assign this JIRA to your credit. You might request this on dev email list. > Incompatible schema for maps between parquet-hive and parquet-pig > ----------------------------------------------------------------- > > Key: HIVE-6783 > URL: https://issues.apache.org/jira/browse/HIVE-6783 > Project: Hive > Issue Type: Bug > Components: File Formats > Affects Versions: 0.13.0 > Reporter: Tongjie Chen > Fix For: 0.14.0 > > Attachments: HIVE-6783.1.patch.txt, HIVE-6783.2.patch.txt, > HIVE-6783.3.patch.txt, HIVE-6783.4.patch.txt > > > see also in following parquet issue: > https://github.com/Parquet/parquet-mr/issues/290 > The schema written for maps isn't compatible between hive and pig. This means > any files written in one cannot be properly read in the other. > More specifically, for the same map column c1, parquet-pig generates schema: > message pig_schema { > optional group c1 (MAP) { > repeated group map (MAP_KEY_VALUE) { > required binary key (UTF8); > optional binary value; > } > } > } > while parquet-hive generates schema: > message hive_schema { > optional group c1 (MAP_KEY_VALUE) { > repeated group map { > required binary key; > optional binary value; > } > } > } -- This message was sent by Atlassian JIRA (v6.2#6252)