Harsha, that document is from 2010.  What version of Hive are you using?

Here's some up-to-date information in the Hive wiki:  Join Optimimzation
<https://cwiki.apache.org/confluence/display/Hive/LanguageManual+JoinOptimization>
.

-- Lefty

On Thu, Apr 16, 2015 at 2:38 AM, Harsha HN <99harsha.h....@gmail.com> wrote:

> Hi All,
>
> I went through below mentioned Facebook engineering page,
> https://www.facebook.com/notes/facebook-engineering/join
> -optimization-in-apache-hive/470667928919
>
> I set following for auto conversion of joins,
> set hive.auto.convert.join=true;
> set hive.mapjoin.smalltable.filesize=1000000000;    (1GB)
>
> I observed some queries performed 2X faster in MAP JOIN as opposed to
> Common join
> and also instances where MAP JOIN is 3X slower than Common Join.
>
> Any thoughts on what might be slowing down MAP JOIN in some cases ?
>
> I have 40 Node cluster, so I have huge RAM available.
>
> Thanks,
> Harsha
>

Reply via email to