anyone any idea? this seems like very strange behavior to me. and it blows
up the job.
On Fri, Jul 22, 2011 at 5:51 PM, Koert Kuipers wrote:
> hello,
> we have 2 tables x and y. table x is 11GB on disk and has 23M rows. table y
> is 3GB on disk and has 28M rows. Both tables are stored as LZO com
hello,
we have 2 tables x and y. table x is 11GB on disk and has 23M rows. table y
is 3GB on disk and has 28M rows. Both tables are stored as LZO compressed
sequencefiles without bucketing.
a normal join of x an y gets executed as a map-reduce-join in hive and works
very well. an outer join also g