[ https://issues.apache.org/jira/browse/HIVE-7142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14096722#comment-14096722 ]
Chengxiang Li commented on HIVE-7142: ------------------------------------- Thanks, [~leftylev], I would like to document it in wiki, just i don't know how to do it yet. I've read [https://cwiki.apache.org/confluence/display/Hive/How+to+edit+the+website], should I create another JIRA and upload a wiki patch based on [https://svn.apache.org/repos/asf/hive/cms/trunk]? > Hive multi serialization encoding support > ----------------------------------------- > > Key: HIVE-7142 > URL: https://issues.apache.org/jira/browse/HIVE-7142 > Project: Hive > Issue Type: Improvement > Components: Serializers/Deserializers > Reporter: Chengxiang Li > Assignee: Chengxiang Li > Labels: TODOC14 > Fix For: 0.14.0 > > Attachments: HIVE-7142.1.patch.txt, HIVE-7142.2.patch, > HIVE-7142.3.patch, HIVE-7142.4.patch > > > Currently Hive only support serialize data into UTF-8 charset bytes or > deserialize from UTF-8 bytes, real world users may want to load different > kinds of encoded data into hive directly. This jira is dedicated to support > serialize/deserialize all kinds of encoded data in SerDe layer. > For user, only need to configure serialization encoding on table level by set > serialization encoding through serde parameter, for example: > {code:sql} > CREATE TABLE person(id INT, name STRING, desc STRING)ROW FORMAT SERDE > 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH > SERDEPROPERTIES("serialization.encoding"='GBK'); > {code} > or > {code:sql} > ALTER TABLE person SET SERDEPROPERTIES ('serialization.encoding'='GBK'); > {code} > LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property > in this patch. -- This message was sent by Atlassian JIRA (v6.2#6252)