Thanks for your reply Viral. However in later versions of hive you don't have
to tell hive anything (which is the smaller table) . During runtime hive itself
identifies the smaller table and do the local map task on the same irrespective
of whether it comes on left or right side of the join. Th
Thanks Yongqiang . I worked for me and I was able to evaluate the performance.
It proved to be expensive :)
Regards
Bejoy K S
-Original Message-
From: yongqiang he
Date: Thu, 31 Mar 2011 22:27:26
To: ;
Reply-To: user@hive.apache.org
Subject: Re: Hive map join - process a little larger
Bejoy,
We still use older version of Hive (0.5). In that version the join order
used to matter. You needed to keep the largest table as the rightmost in
your JOIN sequence to make sure that it is streamed and thus avoid the OOM
exceptions which are caused by mappers which load the entire table in