[ 
https://issues.apache.org/jira/browse/HIVE-5761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13840374#comment-13840374
 ] 

Eric Hanson commented on HIVE-5761:
-----------------------------------

This sounds great. :-)

For VectorExpressions for YEAR/MONTH/DAY etc on this, it will be important to 
verify correctness for all possible dates. Since the number of dates is small 
enough, you could probably do that with a brute force approach in a unit test 
and compare the result to the result obtained with the current Hive date logic.

Also, If you want to start on these VectorExpressions that'd be great, but it 
would be best if you can work to make sure that DATE is supported end-to-end 
soon before going too far with the VectorExpressions. Once you get a query on 
date to run end-to-end, then you can go back and do more VectorExpressions for 
operations on date.

E.g. just getting the following query to run vectorized on an ORC file would be 
a good milestone:

select dateField from tablewithdatecolumn;

Feel free to do these few fun VectorExpressions first if you want :-).


> Implement vectorized support for the DATE data type
> ---------------------------------------------------
>
>                 Key: HIVE-5761
>                 URL: https://issues.apache.org/jira/browse/HIVE-5761
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Eric Hanson
>            Assignee: Teddy Choi
>
> Add support to allow queries referencing DATE columns and expression results 
> to run efficiently in vectorized mode. This should re-use the code for the 
> the integer/timestamp types to the extent possible and beneficial. Include 
> unit tests and end-to-end tests. Consider re-using or extending existing 
> end-to-end tests for vectorized integer and/or timestamp operations.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to