Re: Hive File Sizes, Merging, and Splits

2012-09-26 Thread Ruslan Al-Fakikh
tasks is BETTER. > > > > From: John Omernik [j...@omernik.com] > Sent: Tuesday, September 25, 2012 7:11 PM > To: user@hive.apache.org > Subject: Re: Hive File Sizes, Merging, and Splits > > Isn't there an overhead associated with each map task? Based on that, m

RE: Hive File Sizes, Merging, and Splits

2012-09-25 Thread Connell, Chuck
user@hive.apache.org> Subject: Hive File Sizes, Merging, and Splits I am really struggling trying to make hears or tails out of how to optimize the data in my tables for best query times. I have a partition that is compressed (Gzip) RCFile data in two files total 421877 263715 -rwxr-x

Re: Hive File Sizes, Merging, and Splits

2012-09-25 Thread John Omernik
35 PM, Connell, Chuck wrote: > Why do you think the current generated code is inefficient? > > ** ** > > ** ** > > ** ** > > *From:* John Omernik [mailto:j...@omernik.com] > *Sent:* Tuesday, September 25, 2012 2:57 PM > *To:* user@hive.apache.org > *Subje

RE: Hive File Sizes, Merging, and Splits

2012-09-25 Thread Connell, Chuck
Why do you think the current generated code is inefficient? From: John Omernik [mailto:j...@omernik.com] Sent: Tuesday, September 25, 2012 2:57 PM To: user@hive.apache.org Subject: Hive File Sizes, Merging, and Splits I am really struggling trying to make hears or tails out of how to optimize

Hive File Sizes, Merging, and Splits

2012-09-25 Thread John Omernik
I am really struggling trying to make hears or tails out of how to optimize the data in my tables for best query times. I have a partition that is compressed (Gzip) RCFile data in two files total 421877 263715 -rwxr-xr-x 1 darkness darkness 270044140 2012-09-25 13:32 00_0 158162 -rwxr-xr-x 1