[ 
https://issues.apache.org/jira/browse/HIVE-4246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13735498#comment-13735498
 ] 

Phabricator commented on HIVE-4246:
-----------------------------------

omalley has commented on the revision "HIVE-4246 [jira] Implement predicate 
pushdown for ORC".

INLINE COMMENTS
  ql/src/java/org/apache/hadoop/hive/ql/exec/TableScanOperator.java:244 I need 
the names, because the SARGs are based on column names instead of ids.
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java:1483 They 
really aren't an enum, they are positions in a number stream.
  ql/src/java/org/apache/hadoop/hive/ql/io/sarg/SearchArgument.java:142 +1
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestInStream.java:128 +1

REVISION DETAIL
  https://reviews.facebook.net/D11415

To: JIRA, omalley
Cc: hagleitn

                
> Implement predicate pushdown for ORC
> ------------------------------------
>
>                 Key: HIVE-4246
>                 URL: https://issues.apache.org/jira/browse/HIVE-4246
>             Project: Hive
>          Issue Type: New Feature
>          Components: File Formats
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>         Attachments: HIVE-4246.D11415.1.patch
>
>
> By using the push down predicates from the table scan operator, ORC can skip 
> over 10,000 rows at a time that won't satisfy the predicate. This will help a 
> lot, especially if the file is sorted by the column that is used in the 
> predicate.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to