[ https://issues.apache.org/jira/browse/HIVE-24947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marton Bod updated HIVE-24947: ------------------------------ Description: We have two parquet tables (target and source). Upon running the query: {code:java} set hive.vectorized.execution.enabled=true; insert into target2 partition(part_col_1, part_col_2) select * from source;{code} The following exception is thrown: {code:java} Caused by: java.lang.ClassCastException: java.lang.Integer cannot be cast to [B at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.fillColumnVector(VectorizedListColumnReader.java:308) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.convertValueListToListColumnVector(VectorizedListColumnReader.java:342) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.readBatch(VectorizedListColumnReader.java:91) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.nextBatch(VectorizedParquetRecordReader.java:433) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:376) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:99) at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:365) ... 24 more {code} The same runs without problems when vectorization is turned off. Attaching the show create table statements and the sample parquet file so it can reproduced. cc [~nareshpr] was: We have two parquet tables (target and source). Upon running the query: {code:java} set hive.vectorized.execution.enabled=true; insert into target2 partition(tlmtc_fl_gnrtd_yr_nb, tlmtc_fl_gnrtd_mnth_nb) select * from source;{code} The following exception is thrown: {code:java} Caused by: java.lang.ClassCastException: java.lang.Integer cannot be cast to [B at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.fillColumnVector(VectorizedListColumnReader.java:308) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.convertValueListToListColumnVector(VectorizedListColumnReader.java:342) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.readBatch(VectorizedListColumnReader.java:91) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.nextBatch(VectorizedParquetRecordReader.java:433) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:376) at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:99) at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:365) ... 24 more {code} The same runs without problems when vectorization is turned off. Attaching the show create table statements and the sample parquet file so it can reproduced. cc [~nareshpr] > Casting exception when reading vectorized parquet file for insert into > ---------------------------------------------------------------------- > > Key: HIVE-24947 > URL: https://issues.apache.org/jira/browse/HIVE-24947 > Project: Hive > Issue Type: Bug > Affects Versions: 4.0.0 > Reporter: Marton Bod > Priority: Major > > We have two parquet tables (target and source). > Upon running the query: > {code:java} > set hive.vectorized.execution.enabled=true; > insert into target2 partition(part_col_1, part_col_2) select * from > source;{code} > The following exception is thrown: > {code:java} > Caused by: java.lang.ClassCastException: java.lang.Integer cannot be cast to > [B > at > org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.fillColumnVector(VectorizedListColumnReader.java:308) > at > org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.convertValueListToListColumnVector(VectorizedListColumnReader.java:342) > at > org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.readBatch(VectorizedListColumnReader.java:91) > at > org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.nextBatch(VectorizedParquetRecordReader.java:433) > at > org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:376) > at > org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:99) > at > org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:365) > ... 24 more > {code} > The same runs without problems when vectorization is turned off. > Attaching the show create table statements and the sample parquet file so it > can reproduced. > cc [~nareshpr] -- This message was sent by Atlassian Jira (v8.3.4#803005)