Re: How to enable compaction for table with external data?

Alan Gates Tue, 15 Sep 2015 10:00:22 -0700

If you want it to compact automatically you should not putNO_AUTO_COMPACTION in the table properties.

First question, did you turn on the compactor on your metastore thriftserver? To do this you need to set a couple of values in themetastore's hive-site.xml:


hive.compactor.initiator.on=true
hive.compactor.worker.threads=1 # or more

Alan.

Sachin Pasalkar <mailto:sachin_pasal...@symantec.com>
September 14, 2015 at 3:03
Hi,
We are writing direct orc file from storm topology instead of usinghive streaming (Due to performance issue with our data). However, wewant to compact the data. So we have added the"NO_AUTO_COMPACTION"=“false” option in table which we created to readdata(1.6 GB scattered in multiple small files) in ORC file. Does“NO_AUTO_COMPACTION” means it will not compact data while hivestreaming is used? If no, why it did not compact our data into 1 file?
We also tried manually calling compaction from java code usingorg.apache.hadoop.hive.metastore.txn.TxnHandler’s compact API whichshows it has started compaction, when we execute command Showcompactions. But still does not work. I don’t want to execute themanual commands from command line.
Is there any way?

PS: We are writing all files in one directory only.

Thanks,
Sachin

Re: How to enable compaction for table with external data?

Reply via email to