Re: s3a and hive

2016-11-15 Thread Stephen Sprague
just for the record... this config "hive.exec.stagingdir" determines that ".hive_staging" sub-directory. when it defaults to the table path and the table path is in s3 that's where i get the exception: Failed with exception java.io.IOException: rename for src path: s3a://trulia-dwr-cluster-dev/hi

Hive wrapping lines

2016-11-15 Thread Brotanek, Jan
Hello, I am executing hive command as follows: hive -hiveconf hive.cli.errors.ignore=true -v -S -f tables.sql 2&>tablesCZ.log inside tables.sql is simple sql as following: select min(INT_CURRENT_DATE), max(INT_CURRENT_DATE) from SYSTGXX.POS_T096132_STG_ORC_TMP;$ select min(INT_CURRENT_DATE), max

Re: s3a and hive

2016-11-15 Thread Stephen Sprague
Thanks Elliot. I think you might be onto something there. :) Making that tiny little switch sure seemed attractive but judging from the Jira's out there the ramifications of that setting are far more involved and nuanced than i thought. awright. you make some convincing arguments there. looks lik

Re: s3a and hive

2016-11-15 Thread Elliot West
My gut feeling is that this is not something you should do (except for fun!) I'm fairly confident that somewhere in Hive, MR, or Tez, you'll hit some code that requires consistent, atomic move/copy/list/overwrite semantics from the warehouse filesystem. This is not something that the vanilla S3AFil

Re: s3a and hive

2016-11-15 Thread Stephen Sprague
no. permissions are good. i believe the case to be that s3a does not have a "move" and/or "rename" semantic but i can't be the first one to encounter this. somebody out there has to have gone done this path way before me surely. searching the cyber i find this: https://issues.apache.org/jira/