[ https://issues.apache.org/jira/browse/HIVE-25303?focusedWorklogId=651554&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-651554 ]
ASF GitHub Bot logged work on HIVE-25303: ----------------------------------------- Author: ASF GitHub Bot Created on: 16/Sep/21 09:09 Start Date: 16/Sep/21 09:09 Worklog Time Spent: 10m Work Description: kgyrtkirk commented on a change in pull request #2442: URL: https://github.com/apache/hive/pull/2442#discussion_r709932488 ########## File path: ql/src/java/org/apache/hadoop/hive/ql/parse/TaskCompiler.java ########## @@ -472,6 +474,32 @@ private void setLoadFileLocation( loc = cmv.getLocation(); } Path location = (loc == null) ? getDefaultCtasLocation(pCtx) : new Path(loc); + boolean isExternal = false; + boolean isAcid = false; + if (pCtx.getQueryProperties().isCTAS()) { + isExternal = pCtx.getCreateTable().isExternal(); + isAcid = pCtx.getCreateTable().getTblProps().getOrDefault( + hive_metastoreConstants.TABLE_IS_TRANSACTIONAL, "false").equalsIgnoreCase("true") || + pCtx.getCreateTable().getTblProps().containsKey(hive_metastoreConstants.TABLE_TRANSACTIONAL_PROPERTIES); + if ((HiveConf.getBoolVar(conf, HiveConf.ConfVars.CREATE_TABLE_AS_EXTERNAL) || (isExternal || !isAcid))) { Review comment: that seems to me premature optimization which may just hit back later...it would be simpler to run everything related to location thru the translator and even move the handling of `CREATE_TABLE_AS_EXTERNAL` to there - so that everything is on the same page. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 651554) Time Spent: 3h (was: 2h 50m) > CTAS hive.create.as.external.legacy tries to place data files in managed WH > path > -------------------------------------------------------------------------------- > > Key: HIVE-25303 > URL: https://issues.apache.org/jira/browse/HIVE-25303 > Project: Hive > Issue Type: Bug > Components: HiveServer2, Standalone Metastore > Reporter: Sai Hemanth Gantasala > Assignee: Sai Hemanth Gantasala > Priority: Major > Labels: pull-request-available > Time Spent: 3h > Remaining Estimate: 0h > > Under legacy table creation mode (hive.create.as.external.legacy=true), when > a database has been created in a specific LOCATION, in a session where that > database is Used, tables are created using the following command: > {code:java} > CREATE TABLE <tablename> AS SELECT <select statement>{code} > should inherit the HDFS path from the database's location. Instead, Hive is > trying to write the table data into > /warehouse/tablespace/managed/hive/<database_directory_name>/<table_name> > +Design+: > In the CTAS query, first data is written in the target directory (which > happens in HS2) and then the table is created(This happens in HMS). So here > two decisions are being made i) target directory location ii) how the table > should be created (table type, sd e.t.c). > When HS2 needs a target location that needs to be set, it'll make create > table dry run call to HMS (where table translation happens) and i) and ii) > decisions are made within HMS and returns table object. Then HS2 will use > this location set by HMS for placing the data. -- This message was sent by Atlassian Jira (v8.3.4#803005)