Re: Compaction in hive

2016-12-07 Thread Nishant Aggarwal
Hi Allan, Good Morning Thanks for your reply. We have lots of external tables with parquet and gzip format. Data will be pushed to those tables on regular interval with volume close to 10-20GB/per day. Our concern is that this process will generate lots of small files in the tables. We are searc

Re: Compaction in hive

2016-12-06 Thread Alan Gates
What exactly do you mean by compaction? Hive has a compactor that runs over ACID tables to handle the delta files[1], but I’m guessing you don’t mean that. Are you wanting to concatenate files in existing tables? The usual way to do that is alter table concatenate[2]. Or do you mean somethin