[
https://issues.apache.org/jira/browse/HIVE-3833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13539201#comment-13539201
]
Namit Jain commented on HIVE-3833:
----------------------------------
Consider the following test:
set hive.input.format = org.apache.hadoop.hive.ql.io.CombineHiveInputFormat;
create table partition_test_partitioned(key string, value string) partitioned
by (dt string) stored as rcfile;
alter table partition_test_partitioned set serde
'org.apache.hadoop.hive.serde2.columnar.LazyBinaryColumnarSerDe';
insert overwrite table partition_test_partitioned partition(dt='1') select *
from src where key = 238;
alter table partition_test_partitioned change key key int;
The query:
select * from partition_test_partitioned where dt is not null;
returns:
50 val_238 1
50 val_238 1
This is due to the fact that the key column was serialized as a string column,
and is now being read as a integer.
> object inspectors should be initialized based on partition metadata
> -------------------------------------------------------------------
>
> Key: HIVE-3833
> URL: https://issues.apache.org/jira/browse/HIVE-3833
> Project: Hive
> Issue Type: Improvement
> Components: Query Processor
> Reporter: Namit Jain
> Assignee: Namit Jain
>
> Currently, different partitions can be picked up for the same input split
> based on the
> serdes' etc. And, we dont allow to change the schema for
> LazyColumnarBinarySerDe.
> Instead of that, different partitions should be part of the same split, only
> if the
> partition schemas exactly match. The operator tree object inspectors should
> be based
> on the partition schema. That would give greater flexibility and also help
> using binary serde with rcfile
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira