[ https://issues.apache.org/jira/browse/HIVE-21185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16813048#comment-16813048 ]
chengkun jia edited comment on HIVE-21185 at 4/9/19 6:52 AM: ------------------------------------------------------------- I think this issue is resolved in https://issues.apache.org/jira/browse/HIVE-18833 and released in hive 2.3.4 and 3.0.0 that's what i just wanted. was (Author: lfyzjck): I think this issue is resolved in https://issues.apache.org/jira/browse/HIVE-18833 that's what i just wanted. > insert overwrite directory ... stored as nontextfile raise exception with > merge files open > ------------------------------------------------------------------------------------------ > > Key: HIVE-21185 > URL: https://issues.apache.org/jira/browse/HIVE-21185 > Project: Hive > Issue Type: Bug > Components: Query Planning > Affects Versions: 2.1.1, 2.3.0 > Reporter: chengkun jia > Priority: Major > > reproduce: > > {code:java} > # init table with small files > create table multiple_small_files (id int); > insert into multiple_small_files values (1); > insert into multiple_small_files values (1); > insert into multiple_small_files values (1); > insert into multiple_small_files values (1); > insert into multiple_small_files values (1); > insert into multiple_small_files values (1); > insert into multiple_small_files values (1); > insert into multiple_small_files values (1); > # open small file merge > set hive.merge.mapfiles=true; > set hive.merge.mapredfiles=true; > insert overwrite directory '/path/to/hdfs' stored as avro > select * from multiple_small_files; > {code} > this will produce exception like: > {code:java} > Messages for this Task:Error: java.lang.RuntimeException: > org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while > processing writable > Objavro.schema�{"type":"record","name":"baseRecord","fields":[{"name":"_col0","type":["null","int"],"default":null}]}�$$����N���e(��� > �$$����N���e(��� > at org.apache.hadoop.hive.ql.exec.mr.ExecMapper.map(ExecMapper.java:169) at > org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54) at > org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453) at > org.apache.hadoop.mapred.MapTask.run(MapTask.java:343) at > org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:163) at > java.security.AccessController.doPrivileged(Native Method) at > javax.security.auth.Subject.doAs(Subject.java:422) at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671) > at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)Caused by: > org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while > processing writable > Objavro.schema�{"type":"record","name":"baseRecord","fields":[{"name":"_col0","type":["null","int"],"default":null}]}�$$����N���e(��� > �$$����N���e(��� at > org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:497) at > org.apache.hadoop.hive.ql.exec.mr.ExecMapper.map(ExecMapper.java:160) ... 8 > moreCaused by: org.apache.hadoop.hive.serde2.avro.AvroSerdeException: > Expecting a AvroGenericRecordWritable at > org.apache.hadoop.hive.serde2.avro.AvroDeserializer.deserialize(AvroDeserializer.java:139) > at > org.apache.hadoop.hive.serde2.avro.AvroSerDe.deserialize(AvroSerDe.java:216) > at > org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.readRow(MapOperator.java:128) > at > org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.access$200(MapOperator.java:92) > at org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:488) > ... 9 moreFAILED: Execution Error, return code 2 from > org.apache.hadoop.hive.ql.exec.mr.MapRedTask > {code} > > This issue not only affect avrofile format but all nontextfile storage > format. The rootcause is hive get wrong input format in file merge stage -- This message was sent by Atlassian JIRA (v7.6.3#76005)