[ https://issues.apache.org/jira/browse/HIVE-5871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14124301#comment-14124301 ]
Lefty Leverenz commented on HIVE-5871: -------------------------------------- Doc note: MultiDelimitSerDe needs to be documented in the wiki (with version information and a link to this JIRA ticket). It belongs in some existing docs, and a new doc with limitations and usage examples could be a child page to the SerDe doc (or a new section in the SerDe doc): * [SerDe -- Built-in, Third-Party, and Custom SerDes | https://cwiki.apache.org/confluence/display/Hive/SerDe#SerDe-Built-in,Third-Party,andCustomSerDes] * [DDL -- Create Table -- Row Format, Storage Format, and SerDe | https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-RowFormat,StorageFormat,andSerDe] * possibly [DDL -- Add SerDe Properties | https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AddSerDeProperties] * [Developer Guide -- Hive SerDe (add to "Also:" list at end of section) | https://cwiki.apache.org/confluence/display/Hive/DeveloperGuide#DeveloperGuide-HiveSerDe] * optionally [HCatalog Storage Formats -- SerDes and Storage Formats (first paragraph) | https://cwiki.apache.org/confluence/display/Hive/HCatalog+StorageFormats#HCatalogStorageFormats-SerDesandStorageFormats] A release note could include the example from [~lirui]'s first comment. > Use multiple-characters as field delimiter > ------------------------------------------ > > Key: HIVE-5871 > URL: https://issues.apache.org/jira/browse/HIVE-5871 > Project: Hive > Issue Type: Improvement > Components: Contrib > Affects Versions: 0.12.0 > Reporter: Rui Li > Assignee: Rui Li > Labels: TODOC14 > Fix For: 0.14.0 > > Attachments: HIVE-5871.2.patch, HIVE-5871.3.patch, HIVE-5871.4.patch, > HIVE-5871.5.patch, HIVE-5871.6.patch, HIVE-5871.patch > > > By default, hive only allows user to use single character as field delimiter. > Although there's RegexSerDe to specify multiple-character delimiter, it can > be daunting to use, especially for amateurs. > In the patch, I add a new SerDe named MultiDelimitSerDe. With > MultiDelimitSerDe, users can specify a multiple-character field delimiter > when creating tables, in a way most similar to typical table creations. -- This message was sent by Atlassian JIRA (v6.3.4#6332)