[ 
https://issues.apache.org/jira/browse/HIVE-13873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607039#comment-15607039
 ] 

ASF GitHub Bot commented on HIVE-13873:
---------------------------------------

Github user asfgit closed the pull request at:

    https://github.com/apache/hive/pull/105


> Support column pruning for struct fields in select statement
> ------------------------------------------------------------
>
>                 Key: HIVE-13873
>                 URL: https://issues.apache.org/jira/browse/HIVE-13873
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Logical Optimizer
>            Reporter: Xuefu Zhang
>            Assignee: Ferdinand Xu
>         Attachments: HIVE-13873.1.patch, HIVE-13873.2.patch, 
> HIVE-13873.3.patch, HIVE-13873.4.patch, HIVE-13873.5.patch, 
> HIVE-13873.6.patch, HIVE-13873.patch, HIVE-13873.wip.patch
>
>
> This is the grounding work for the nested column pruning in Hive, for Parquet 
> format. In this patch, we address the case for struct type in select 
> statements. In particular, for queries such as:
> {code}
> select s.a from tbl
> {code}
> where {{tbl}} has schema:
> {code}
> s:struct<a:int, b:boolean, c:array<int>>
> {code}
> then only the field {{a}} should have been scanned in the Parquet reader, 
> while field {{b}} and {{c}} can be ignored.
> Future work includes support other types of statements, as well as more 
> combinations of types (e.g., selecting fields of array type inside a struct 
> type).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to