[jira] [Work logged] (HIVE-25453) Add LLAP IO support for Iceberg ORC tables

ASF GitHub Bot (Jira) Thu, 09 Sep 2021 02:49:04 -0700


     [ 
https://issues.apache.org/jira/browse/HIVE-25453?focusedWorklogId=648493&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-648493
 ]


ASF GitHub Bot logged work on HIVE-25453:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 09/Sep/21 09:48
            Start Date: 09/Sep/21 09:48
    Worklog Time Spent: 10m 
      Work Description: szlta commented on pull request #2586:
URL: https://github.com/apache/hive/pull/2586#issuecomment-915933332


   Note: final passing test was not reported to PR so linking it here by hand:
   http://ci.hive.apache.org/job/hive-precommit/job/PR-2586/14/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 648493)
    Time Spent: 3.5h  (was: 3h 20m)

> Add LLAP IO support for Iceberg ORC tables
> ------------------------------------------
>
>                 Key: HIVE-25453
>                 URL: https://issues.apache.org/jira/browse/HIVE-25453
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Ádám Szita
>            Assignee: Ádám Szita
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.0.0
>
>          Time Spent: 3.5h
>  Remaining Estimate: 0h
>
> Adding support for reading Iceberg ORC tables via LLAP..
> The easy part is swapping out the plain simple VectorizedOrcRecordReader to 
> LlapRecordReader.
> The hard part is maintaining correctness even after a series of schema 
> changes that are normally allowed to Iceberg/ORC, but were not for simple ORC 
> or therefore for LLAP. To make it all work, LLAP had to be made to support a 
> broader schema evolution.
> Before this change LLAP made the simple assumption that the reader and file 
> schemas match all columns, now separate physical and logical read schemas and 
> corresponding include lists are used instead. Also added 
> logicalOrderedColumnIds here, which holds indices from the reader schema, but 
> in file schema order - a necessary tool for mapping the results produced by 
> LLAP, as LLAP always reads columns in the order as they are written out in 
> the file.
> Also added a new CLI driver class for testing the cached reads from 
> Iceberg/ORC tables via LLAP.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Work logged] (HIVE-25453) Add LLAP IO support for Iceberg ORC tables

Reply via email to