[ https://issues.apache.org/jira/browse/HIVE-13873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607039#comment-15607039 ]
ASF GitHub Bot commented on HIVE-13873: --------------------------------------- Github user asfgit closed the pull request at: https://github.com/apache/hive/pull/105 > Support column pruning for struct fields in select statement > ------------------------------------------------------------ > > Key: HIVE-13873 > URL: https://issues.apache.org/jira/browse/HIVE-13873 > Project: Hive > Issue Type: Sub-task > Components: Logical Optimizer > Reporter: Xuefu Zhang > Assignee: Ferdinand Xu > Attachments: HIVE-13873.1.patch, HIVE-13873.2.patch, > HIVE-13873.3.patch, HIVE-13873.4.patch, HIVE-13873.5.patch, > HIVE-13873.6.patch, HIVE-13873.patch, HIVE-13873.wip.patch > > > This is the grounding work for the nested column pruning in Hive, for Parquet > format. In this patch, we address the case for struct type in select > statements. In particular, for queries such as: > {code} > select s.a from tbl > {code} > where {{tbl}} has schema: > {code} > s:struct<a:int, b:boolean, c:array<int>> > {code} > then only the field {{a}} should have been scanned in the Parquet reader, > while field {{b}} and {{c}} can be ignored. > Future work includes support other types of statements, as well as more > combinations of types (e.g., selecting fields of array type inside a struct > type). -- This message was sent by Atlassian JIRA (v6.3.4#6332)