[jira] [Updated] (HIVE-24947) Casting exception when reading vectorized parquet file for insert into

Marton Bod (Jira) Fri, 26 Mar 2021 07:50:04 -0700


     [ 
https://issues.apache.org/jira/browse/HIVE-24947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Marton Bod updated HIVE-24947:
------------------------------
    Description: 
We have two parquet tables (target and source).

Upon running the query:
{code:java}
set hive.vectorized.execution.enabled=true;
insert into target2 partition(part_col_1, part_col_2) select * from 
source;{code}
The following exception is thrown:
{code:java}
Caused by: java.lang.ClassCastException: java.lang.Integer cannot be cast to [B
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.fillColumnVector(VectorizedListColumnReader.java:308)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.convertValueListToListColumnVector(VectorizedListColumnReader.java:342)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.readBatch(VectorizedListColumnReader.java:91)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.nextBatch(VectorizedParquetRecordReader.java:433)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:376)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:99)
        at 
org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:365)
        ... 24 more
{code}
The same runs without problems when vectorization is turned off. 

Attaching the show create table statements and the sample parquet file so it 
can reproduced.

cc [~nareshpr]

  was:
We have two parquet tables (target and source).

Upon running the query:
{code:java}
set hive.vectorized.execution.enabled=true;
insert into target2 partition(tlmtc_fl_gnrtd_yr_nb, tlmtc_fl_gnrtd_mnth_nb) 
select * from source;{code}
The following exception is thrown:
{code:java}
Caused by: java.lang.ClassCastException: java.lang.Integer cannot be cast to [B
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.fillColumnVector(VectorizedListColumnReader.java:308)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.convertValueListToListColumnVector(VectorizedListColumnReader.java:342)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.readBatch(VectorizedListColumnReader.java:91)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.nextBatch(VectorizedParquetRecordReader.java:433)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:376)
        at 
org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:99)
        at 
org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:365)
        ... 24 more
{code}
The same runs without problems when vectorization is turned off. 

Attaching the show create table statements and the sample parquet file so it 
can reproduced.

cc [~nareshpr]


> Casting exception when reading vectorized parquet file for insert into
> ----------------------------------------------------------------------
>
>                 Key: HIVE-24947
>                 URL: https://issues.apache.org/jira/browse/HIVE-24947
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 4.0.0
>            Reporter: Marton Bod
>            Priority: Major
>
> We have two parquet tables (target and source).
> Upon running the query:
> {code:java}
> set hive.vectorized.execution.enabled=true;
> insert into target2 partition(part_col_1, part_col_2) select * from 
> source;{code}
> The following exception is thrown:
> {code:java}
> Caused by: java.lang.ClassCastException: java.lang.Integer cannot be cast to 
> [B
>       at 
> org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.fillColumnVector(VectorizedListColumnReader.java:308)
>       at 
> org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.convertValueListToListColumnVector(VectorizedListColumnReader.java:342)
>       at 
> org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedListColumnReader.readBatch(VectorizedListColumnReader.java:91)
>       at 
> org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.nextBatch(VectorizedParquetRecordReader.java:433)
>       at 
> org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:376)
>       at 
> org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.next(VectorizedParquetRecordReader.java:99)
>       at 
> org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:365)
>       ... 24 more
> {code}
> The same runs without problems when vectorization is turned off. 
> Attaching the show create table statements and the sample parquet file so it 
> can reproduced.
> cc [~nareshpr]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Updated] (HIVE-24947) Casting exception when reading vectorized parquet file for insert into

Reply via email to