[ https://issues.apache.org/jira/browse/HIVE-25453?focusedWorklogId=648493&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-648493 ]
ASF GitHub Bot logged work on HIVE-25453: ----------------------------------------- Author: ASF GitHub Bot Created on: 09/Sep/21 09:48 Start Date: 09/Sep/21 09:48 Worklog Time Spent: 10m Work Description: szlta commented on pull request #2586: URL: https://github.com/apache/hive/pull/2586#issuecomment-915933332 Note: final passing test was not reported to PR so linking it here by hand: http://ci.hive.apache.org/job/hive-precommit/job/PR-2586/14/ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 648493) Time Spent: 3.5h (was: 3h 20m) > Add LLAP IO support for Iceberg ORC tables > ------------------------------------------ > > Key: HIVE-25453 > URL: https://issues.apache.org/jira/browse/HIVE-25453 > Project: Hive > Issue Type: New Feature > Reporter: Ádám Szita > Assignee: Ádám Szita > Priority: Major > Labels: pull-request-available > Fix For: 4.0.0 > > Time Spent: 3.5h > Remaining Estimate: 0h > > Adding support for reading Iceberg ORC tables via LLAP.. > The easy part is swapping out the plain simple VectorizedOrcRecordReader to > LlapRecordReader. > The hard part is maintaining correctness even after a series of schema > changes that are normally allowed to Iceberg/ORC, but were not for simple ORC > or therefore for LLAP. To make it all work, LLAP had to be made to support a > broader schema evolution. > Before this change LLAP made the simple assumption that the reader and file > schemas match all columns, now separate physical and logical read schemas and > corresponding include lists are used instead. Also added > logicalOrderedColumnIds here, which holds indices from the reader schema, but > in file schema order - a necessary tool for mapping the results produced by > LLAP, as LLAP always reads columns in the order as they are written out in > the file. > Also added a new CLI driver class for testing the cached reads from > Iceberg/ORC tables via LLAP. -- This message was sent by Atlassian Jira (v8.3.4#803005)