[ https://issues.apache.org/jira/browse/HIVE-13632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15263146#comment-15263146 ]
Sergio Peña commented on HIVE-13632: ------------------------------------ [~ychena] I think we should try to be consistent with other file formats before committing this patch. Could you investigate what other format serde or object inspectors do in order to translate this to empty lists? > Hive failing on insert empty array into parquet table > ----------------------------------------------------- > > Key: HIVE-13632 > URL: https://issues.apache.org/jira/browse/HIVE-13632 > Project: Hive > Issue Type: Bug > Components: Serializers/Deserializers > Affects Versions: 1.1.0 > Reporter: Yongzhi Chen > Assignee: Yongzhi Chen > Attachments: HIVE-13632.1.patch > > > The insert will fail with following stack: > {noformat} > by: parquet.io.ParquetEncodingException: empty fields are illegal, the field > should be ommited completely instead > at > parquet.io.MessageColumnIO$MessageColumnIORecordConsumer.endField(MessageColumnIO.java:271) > at > org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter$ListDataWriter.write(DataWritableWriter.java:271) > at > org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter$GroupDataWriter.write(DataWritableWriter.java:199) > at > org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter$MessageDataWriter.write(DataWritableWriter.java:215) > at > org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriter.write(DataWritableWriter.java:88) > at > org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriteSupport.write(DataWritableWriteSupport.java:59) > at > org.apache.hadoop.hive.ql.io.parquet.write.DataWritableWriteSupport.write(DataWritableWriteSupport.java:31) > at > parquet.hadoop.InternalParquetRecordWriter.write(InternalParquetRecordWriter.java:116) > at > parquet.hadoop.ParquetRecordWriter.write(ParquetRecordWriter.java:123) > at parquet.hadoop.ParquetRecordWriter.write(ParquetRecordWriter.java:42) > at > org.apache.hadoop.hive.ql.io.parquet.write.ParquetRecordWriterWrapper.write(ParquetRecordWriterWrapper.java:111) > at > org.apache.hadoop.hive.ql.io.parquet.write.ParquetRecordWriterWrapper.write(ParquetRecordWriterWrapper.java:124) > at > org.apache.hadoop.hive.ql.exec.FileSinkOperator.processOp(FileSinkOperator.java:697) > {noformat} > Reproduce: > {noformat} > create table test_small ( > key string, > arrayValues array<string>) > stored as parquet; > insert into table test_small select 'abcd', array() from src limit 1; > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)