[ 
https://issues.apache.org/jira/browse/HIVE-7142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14096642#comment-14096642
 ] 

Lefty Leverenz commented on HIVE-7142:
--------------------------------------

This should be documented in wiki DDL sections "Row Format, Storage Format, and 
SerDe" and "Add SerDe Properties" (although ideally the SerDe doc would have a 
section with details about SERDEPROPERTIES):

* [DDL -- Row Format, Storage Format, and SerDe | 
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-RowFormat,StorageFormat,andSerDe]
* [DDL -- Add SerDe Properties | 
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AddSerDeProperties]
* [SerDe | https://cwiki.apache.org/confluence/display/Hive/SerDe]

The jira's description could easily be turned into a release note.

> Hive multi serialization encoding support
> -----------------------------------------
>
>                 Key: HIVE-7142
>                 URL: https://issues.apache.org/jira/browse/HIVE-7142
>             Project: Hive
>          Issue Type: Improvement
>          Components: Serializers/Deserializers
>            Reporter: Chengxiang Li
>            Assignee: Chengxiang Li
>              Labels: TODOC14
>             Fix For: 0.14.0
>
>         Attachments: HIVE-7142.1.patch.txt, HIVE-7142.2.patch, 
> HIVE-7142.3.patch, HIVE-7142.4.patch
>
>
> Currently Hive only support serialize data into UTF-8 charset bytes or 
> deserialize from UTF-8 bytes, real world users may want to load different 
> kinds of encoded data into hive directly. This jira is dedicated to support 
> serialize/deserialize all kinds of encoded data in SerDe layer. 
> For user, only need to configure serialization encoding on table level by set 
> serialization encoding through serde parameter, for example:
> {code:sql}
> CREATE TABLE person(id INT, name STRING, desc STRING)ROW FORMAT SERDE 
> 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH 
> SERDEPROPERTIES("serialization.encoding"='GBK');
> {code}
> or
> {code:sql}
> ALTER TABLE person SET SERDEPROPERTIES ('serialization.encoding'='GBK'); 
> {code}
> LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property 
> in this patch.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to