[ 
https://issues.apache.org/jira/browse/HIVE-25331?focusedWorklogId=622858&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-622858
 ]

ASF GitHub Bot logged work on HIVE-25331:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 15/Jul/21 05:44
            Start Date: 15/Jul/21 05:44
    Worklog Time Spent: 10m 
      Work Description: ujc714 opened a new pull request #2478:
URL: https://github.com/apache/hive/pull/2478


   ### What changes were proposed in this pull request?
   Use a default directory for MANAGEDLOCATION if it's not assigned in CREATE 
DATABASE query.
   
   ### Why are the changes needed?
   HMS doesn't create MANAGEDLOCATION directory if it's NULL. If we run a CTAS 
query immediately after the CREATE DATABASE query and the staging directory is 
not under the MANAGEDLOCATION directory, the CTAS query will fail in MOVE task.
   
   ### Does this PR introduce _any_ user-facing change?
   No.
   
   
   ### How was this patch tested?
   mvn test -Dtest=TestMiniTezCliDriver -Dqfile=create_database.q


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 622858)
    Remaining Estimate: 0h
            Time Spent: 10m

> Create database query doesn't create MANAGEDLOCATION directory
> --------------------------------------------------------------
>
>                 Key: HIVE-25331
>                 URL: https://issues.apache.org/jira/browse/HIVE-25331
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Robbie Zhang
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> If we don't assign MANAGEDLOCATION in a "create database" query, the 
> MANAGEDLOCATION will be NULL so HMS doesn't create the directory. In this 
> case, a CTAS query immediately after the CREATE DATABASE query might fail in 
> MOVE task due to "destination's parent does not exist". I can use the 
> following script to reproduce this issue:
> {code:java}
> set hive.support.concurrency=true;
> set hive.txn.manager=org.apache.hadoop.hive.ql.lockmgr.DbTxnManager;
> create database testdb location '/tmp/testdb.db';
> create table testdb.test as select 1;
> {code}
> If the staging directory is under the MANAGEDLOCATION directory, the CTAS 
> query is fine as the MANAGEDLOCATION directory is created while creating the 
> staging directory. Since we set LOCATION to a default directory when LOCATION 
> is not assigned in the CREATE DATABASE query, I believe it's worth to set 
> MANAGEDLOCATION to a default directory, too.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to