[ 
https://issues.apache.org/jira/browse/HIVE-26771?focusedWorklogId=828380&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-828380
 ]

ASF GitHub Bot logged work on HIVE-26771:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 23/Nov/22 13:57
            Start Date: 23/Nov/22 13:57
    Worklog Time Spent: 10m 
      Work Description: kasakrisz opened a new pull request, #3802:
URL: https://github.com/apache/hive/pull/3802

   ### What changes were proposed in this pull request?
   Refactor the way ctas is executed:
   * Do not create the table in `HiveIcebergSerDe` since it is also created at 
compile time.
   * Add a DDLTask before the `TezTask` to create the Iceberg table.
   * Collect the properties added to jobconfig from the Serde object and 
location and fileio from HiveCatalog. Location can be calculated at compile 
time using `SemanticAnalyzer.getCtasOrCMVLocation`
   * Persist the new table metaobject to a temp file when committing the table 
creation.
   * Read back the table metaobject anytime it is required from the TezTask and 
the MoveTask.
   
   
   ### Why are the changes needed?
   See jira.
   
   ### Does this PR introduce _any_ user-facing change?
   No
   
   ### How was this patch tested?
   ```
   mvn test -Dtest.output.overwrite -Dtest=TestIcebergCliDriver 
-Dqfile=ctas_iceberg_partitioned_orc.q -pl itests/qtest-iceberg -Piceberg 
-Pitests -Drat.skip
   ```




Issue Time Tracking
-------------------

            Worklog Id:     (was: 828380)
    Remaining Estimate: 0h
            Time Spent: 10m

> Use DDLTask to created Iceberg table when running ctas statement
> ----------------------------------------------------------------
>
>                 Key: HIVE-26771
>                 URL: https://issues.apache.org/jira/browse/HIVE-26771
>             Project: Hive
>          Issue Type: Improvement
>          Components: Iceberg integration
>            Reporter: Krisztian Kasa
>            Assignee: Krisztian Kasa
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When Iceberg table is created via ctas statement the table is created in 
> HiveIcebergSerDe and no DDL task is executed.
> Negative effects of this workflow:
> * Default privileges of the new table are not granted.
> * The new Iceberg table can be seen by other transactions at compile time of 
> ctas.
> * Table creation and table properties are not shown in explain ctas output.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to