[ 
https://issues.apache.org/jira/browse/HIVE-2238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13701600#comment-13701600
 ] 

Edward Capriolo commented on HIVE-2238:
---------------------------------------

Hey all, 
lets regroup and get this added.

A few comments. 
1) fix your formatting. You have to use the apache style (we have an exclipse 
template on the wiki somewhere
2) the logic in getEvaluator. It would be better to have the if statement check 
for a positive list of types it accepts rather then a negative (inclomplete) 
list of things it does not.
3) instead of  public ArrayList<DoubleWritable> mode() { use  public 
List<DoubleWritable> mode() { when possible
4) LOAD DATA LOCAL INPATH '/home/691623/Desktop/input.txt' OVERWRITE data must 
be loaded from the data directory like other tests in our test framework.
                
> Support for Median and Mode UDAFs
> ---------------------------------
>
>                 Key: HIVE-2238
>                 URL: https://issues.apache.org/jira/browse/HIVE-2238
>             Project: Hive
>          Issue Type: New Feature
>          Components: UDF
>            Reporter: Travis Powell
>              Labels: patch
>         Attachments: HIVE-2238.1.patch.txt, input.txt, median.q, 
> median.q.out, mode.q, mode.q.out
>
>
> Median and Mode are essential functions for reducing/refining the data set, 
> and would allow for greater control over the selection of data. More involved 
> analytics are probably best handled by relational databases or OLAP cubes, 
> but Median and Mode are very practical for Hive solely in terms of delivering 
> a smaller data set, where items selected only have a certain mode. (Rows that 
> describe an object to which the table is joined where that object has a 
> column value frequency threshold.)
> Comments are more than welcome. Would be happy to support. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to