ut how to do the map side join in Spark.
>>>
>>> In 1.5.x, there is a broadcast function in the Dataframe, and it caused
>>> OOM for me simple test case, even one side of join is very small.
>>>
>>> I am still trying to find out the root cause yet.
&g
t the root cause yet.
>>
>> Yong
>>
>> ------
>> Date: Wed, 2 Mar 2016 15:38:29 +0530
>> Subject: Re: Mapper side join with DataFrames API
>> From: dgk...@gmail.com
>> To: mich...@databricks.com
>> CC: u...@spark.apache
Hello All,
Just to add to this question a bit more context
I have a join as stated above and I see in my executor logs the below :
16/02/29 17:02:35 INFO TaskSetManager: Finished task 198.0 in stage 7.0
(TID 1114) in 20354 ms on localhost (196/200)
16/02/29 17:02:35 INFO ShuffleBlockFetcher