[ https://issues.apache.org/jira/browse/HIVE-25628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ádám Szita reassigned HIVE-25628: --------------------------------- > Avoid unnecessary file ops if Iceberg table is LLAP cached > ---------------------------------------------------------- > > Key: HIVE-25628 > URL: https://issues.apache.org/jira/browse/HIVE-25628 > Project: Hive > Issue Type: Improvement > Components: llap > Reporter: Ádám Szita > Assignee: Ádám Szita > Priority: Major > > In case the query execution is vectorized for an Iceberg table, we need to > make an extra file open operation on the ORC file to learn what the file > schema is (to be matched later with the logical schema). > In LLAP configuration the file schema could be retrieved through LLAP cache > as ORC metadata is cached, so we should avoid the file operation when > possible. > Also: LLAP relies on cache keys that are usually triplets of file information > and is constructed by an FS.listStatus call. For iceberg tables we should > rely on such file information provided by Iceberg's metadata to spare this > call too. -- This message was sent by Atlassian Jira (v8.3.4#803005)