Re: Hive map join - process a little larger tables withmoderatenumber of rows

2011-04-01 Thread bejoy_ks
Thanks for your reply Viral. However in later versions of hive you don't have to tell hive anything (which is the smaller table) . During runtime hive itself identifies the smaller table and do the local map task on the same irrespective of whether it comes on left or right side of the join. Th

Re: Hive map join - process a little larger tables withmoderatenumber of rows

2011-04-01 Thread bejoy_ks
Thanks Yongqiang . I worked for me and I was able to evaluate the performance. It proved to be expensive :) Regards Bejoy K S -Original Message- From: yongqiang he Date: Thu, 31 Mar 2011 22:27:26 To: ; Reply-To: user@hive.apache.org Subject: Re: Hive map join - process a little larger

Re: Hive map join - process a little larger tables with moderatenumber of rows

2011-04-01 Thread Viral Bajaria
Bejoy, We still use older version of Hive (0.5). In that version the join order used to matter. You needed to keep the largest table as the rightmost in your JOIN sequence to make sure that it is streamed and thus avoid the OOM exceptions which are caused by mappers which load the entire table in