Is this table bucketed? If so, please set the number of reducers (set
mapreduce.job.reduces=bucket_size) to match to the table's bucket size.
On Thu, Dec 17, 2015 at 1:25 AM, Awhan Patnaik wrote:
> 3 node cluster with 15 gigs of RAM per node. Two tables L is approximately
> 1 Million rows, U is
3 node cluster with 15 gigs of RAM per node. Two tables L is approximately
1 Million rows, U is 100 Million. They both have latitude and longitude
columns. I want to find the count of rows in U that are within a 10 mile
radius of each of the row in L.
I have indexed the latitude and longitude colu