Re: Hive transactions not working

2016-11-16 Thread Manoj Murumkar
By default, I have these: hadoop.proxyuser.hive.hosts=* hadoop.proxyuser.hive.groups=* Also, I have impersonation enabled. These settings should have made the compaction to run without issues, but does not. What am I missing? On Wed, Nov 16, 2016 at 6:00 PM, Manoj Murumkar wrote: > Okay, made

Re: Hive transactions not working

2016-11-16 Thread Manoj Murumkar
Okay, made the cleaner work as well :) When the files are owned by a user that is different from the user that runs metastore ("hive"), this doesn't work. Just to test this, I changed ownership of the files for the table to "hive" and ran a major compaction and it does the job. drwxrwxrwt - hiv

Re: Hive transactions not working

2016-11-16 Thread Manoj Murumkar
Further update: I see this: +---+--+++-+---++--+ | dbname | tabname | partname | type |state| workerid | starttime| +---+--+++-+--

Re: Hive transactions not working

2016-11-16 Thread Manoj Murumkar
Quick update: After each compaction, files under base directory (for the buckets) have latest data. However, I am expecting to see all delta files (and directories) gone, as they should be merged in the base directory. Otherwise, we'll start seeing too many small files on HDFS which is a problem.

Hive transactions not working

2016-11-16 Thread Manoj Murumkar
Hi, We are trying to implement transaction feature in hive. I created following table: ++--+ | createtab_stmt | +