[ 
https://issues.apache.org/jira/browse/HIVE-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13547347#comment-13547347
 ] 

Alan Gates commented on HIVE-896:
---------------------------------

Harish,

Could you point out the interfaces (in the API sense, not the Java sense) that 
are most important in this patch?  In particular I'm intersted in interfaces 
between UDFs and Hive.  Based on my review so far the classes that stand out as 
important in this regard are TableFunctionEvaluator, TableFunctionResolver, and 
PTFPartition.  Are there others I should be looking at?

Questions I have so far:
* If I read this right you are using CLUSTER BY and SORT BY instead of 
PARTITION BY and ORDER BY for syntax in OVER.  Why?
* Does it ever make sense for a windowing function to return a partition?  
Should there be a interface/abstract class specific for windowing functions 
that only returns a single entry?
* Can I put one of the existing aggregate functions in an OVER clause using 
this?
* Could you explain how the partition is handled in memory?  It looks to me as 
if the entire partition is read into memory.  Is that correct?  If so, does it 
read it aggresively or as the iterator moves through the records?  It also 
appears there is no effort to drop earlier parts of the partition that are now 
out of range of the window.  Is that also correct?

                
> Add LEAD/LAG/FIRST/LAST analytical windowing functions to Hive.
> ---------------------------------------------------------------
>
>                 Key: HIVE-896
>                 URL: https://issues.apache.org/jira/browse/HIVE-896
>             Project: Hive
>          Issue Type: New Feature
>          Components: OLAP, UDF
>            Reporter: Amr Awadallah
>            Priority: Minor
>         Attachments: HIVE-896.1.patch.txt
>
>
> Windowing functions are very useful for click stream processing and similar 
> time-series/sliding-window analytics.
> More details at:
> http://download-west.oracle.com/docs/cd/B13789_01/server.101/b10736/analysis.htm#i1006709
> http://download-west.oracle.com/docs/cd/B13789_01/server.101/b10736/analysis.htm#i1007059
> http://download-west.oracle.com/docs/cd/B13789_01/server.101/b10736/analysis.htm#i1007032
> -- amr

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to