[
https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13587827#comment-13587827
]
Krishna commented on HIVE-4053:
-------------------------------
There are 6 popular phonetic algorithms (as mentioned in JIRA description). I
think, it's a good idea to implement all of them in Hive. There are 2 ways to
implement:
Option 1: Write a separate GenericUDF for each algorithm and there'll be a
separate Hive function for each algorithm
Option 2: Write one GenericUDF and use a parameter argument to this function
for determining which algorithm is called.
I prefer to implement the algorithms using option (2) but if someone feels
option (1) is better, please comment.
> Add support for phonetic algorithms in Hive
> -------------------------------------------
>
> Key: HIVE-4053
> URL: https://issues.apache.org/jira/browse/HIVE-4053
> Project: Hive
> Issue Type: New Feature
> Components: UDF
> Affects Versions: 0.10.0
> Reporter: Krishna
> Labels: patch
> Fix For: 0.10.0
>
> Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java,
> HIVE-4053.1.patch.txt
>
>
> Following phonetic algorithms should be considered, which are very useful in
> search:
> Soundex: http://en.wikipedia.org/wiki/Soundex
> Refined Soundex: Refer to the comment on 22/Feb/13 23:51
> Daitch–Mokotoff Soundex:
> http://en.wikipedia.org/wiki/Daitch%E2%80%93Mokotoff_Soundex
> Metaphone and Double Metaphone: http://en.wikipedia.org/wiki/Metaphone
> New York State Identification and Intelligence System (NYSIIS):
> http://en.wikipedia.org/wiki/New_York_State_Identification_and_Intelligence_System
> Caverphone: http://en.wikipedia.org/wiki/Caverphone
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira