It went from about 60 mins to 3 mins. Hive was traversing the whole table
multiple times, which is obviously inefficient!
> Date: Tue, 7 Jul 2015 15:55:19 -0700
> Subject: Re: Limiting outer join
> From: gop...@apache.org
> To: user@hive.apache.org
>
>
> > Never mind, I got it working with UDF. I just pass the file location to
> >my evaluate function. Thanks! :)
>
> Nice. Would be very interested in looking at performance of such a UDF, if
> you have numbers before/after.
>
> I suspect it will be a magnitude or more faster than the BETWEEN/JOIN
> clauses.
>
> Cheers,
> Gopal
>
>