Re: Broadcast Join and Inner Join giving different result on same DataFrame

2017-01-03 Thread ayan guha
I think productBroadcastDF is broadcast variable in your case, not the DF itself. Try the join with productBroadcastDF.value On Wed, Jan 4, 2017 at 1:04 AM, Patrick wrote: > Hi, > > An Update on above question: In Local[*] mode code is working fine. The > Broadcast size is 200MB, but on Yarn it

Re: Broadcast Join and Inner Join giving different result on same DataFrame

2017-01-03 Thread Patrick
Hi, An Update on above question: In Local[*] mode code is working fine. The Broadcast size is 200MB, but on Yarn it the broadcast join is giving empty result.But in Sql Query in UI, it does show BroadcastHint. Thanks On Fri, Dec 30, 2016 at 9:15 PM, titli batali wrote: > Hi, > > I have two da

Broadcast Join and Inner Join giving different result on same DataFrame

2016-12-30 Thread titli batali
Hi, I have two dataframes which has common column Product_Id on which i have to perform a join operation. val transactionDF = readCSVToDataFrame(sqlCtx: SQLContext, pathToReadTransactions: String, transactionSchema: StructType) val productDF = readCSVToDataFrame(sqlCtx: SQLContext, pathTo