Re: How to configure Hive to use CombineFileInputFormat in case of too many small files

2011-04-08 Thread Michael Jiang
the file sizes. I haven't tried it but i guess its > there in this page. > > -- > *From:* Michael Jiang > *To:* user@hive.apache.org > *Cc:* V.Senthil Kumar > *Sent:* Fri, April 8, 2011 11:56:21 AM > *Subject:* Re: How to configure Hive to us

Re: How to configure Hive to use CombineFileInputFormat in case of too many small files

2011-04-08 Thread Michael Jiang
Thanks Kumar. Are there other settings to fine tune how small files are merged into a bigger one that a mapper takes? Basically I want to match the size of a merged file to the block size. On Fri, Apr 8, 2011 at 11:43 AM, V.Senthil Kumar wrote: > You can add these lines in hive-site.xml. It cr

Re: How to configure Hive to use CombineFileInputFormat in case of too many small files

2011-04-08 Thread V.Senthil Kumar
You can add these lines in hive-site.xml. It creates only one file at the end. Hope it helps. hive.merge.mapredfiles true Merge small files at the end of a map-reduce job hive.input.format org.apache.hadoop.hive.ql.io.CombineHiveInputFormat The default input format, if it is not s