Re: Broadcast hash join implementation in Spark

2016-07-09 Thread Lalitha MV
Hi Jagat, This property only defines the threshold of small table's size for broadcast hash join to be supported. Lalitha On Fri, Jul 8, 2016 at 11:47 PM, Jagat Singh wrote: > Hi, > > Please see the property spark.sql.autoBroadcastJoinThreshold here > > > http://spark.apache.org/docs/latest/s

Re: Broadcast hash join implementation in Spark

2016-07-08 Thread Jagat Singh
Hi, Please see the property spark.sql.autoBroadcastJoinThreshold here http://spark.apache.org/docs/latest/sql-programming-guide.html#other-configuration-options Thanks, Jagat Singh On Sat, Jul 9, 2016 at 9:50 AM, Lalitha MV wrote: > Hi, > > 1. What implementation is used for the hash join --

Broadcast hash join implementation in Spark

2016-07-08 Thread Lalitha MV
Hi, 1. What implementation is used for the hash join -- is it classic hash join or Hybrid grace hash join? 2. If the hash table does not fit in memory, does it spill or does it fail? Are there parameters to control this (for example to set the percentage of hash table which can spill etc.) 3. Is t