Re: Broadcast hash join implementation in Spark

2016-07-09 Thread Lalitha MV
.apache.org/docs/latest/sql-programming-guide.html#other-configuration-options > > Thanks, > > Jagat Singh > > On Sat, Jul 9, 2016 at 9:50 AM, Lalitha MV wrote: > >> Hi, >> >> 1. What implementation is used for the hash join -- is it classic hash >> joi

Broadcast hash join implementation in Spark

2016-07-08 Thread Lalitha MV
Hi, 1. What implementation is used for the hash join -- is it classic hash join or Hybrid grace hash join? 2. If the hash table does not fit in memory, does it spill or does it fail? Are there parameters to control this (for example to set the percentage of hash table which can spill etc.) 3. Is t

Re: Enforcing shuffle hash join

2016-07-04 Thread Lalitha MV
precedence: > * - BroadcastNestedLoopJoin: if one side of the join could be broadcasted > * - CartesianProduct: for Inner join > * - BroadcastNestedLoopJoin > */ > > > > On Jul 5, 2016, at 13:28, Lalitha MV wrote: > > It picks sort merge join, when spark.sql

Re: Enforcing shuffle hash join

2016-07-04 Thread Lalitha MV
maropu > > On Tue, Jul 5, 2016 at 4:23 AM, Lalitha MV wrote: > >> Hi maropu, >> >> Thanks for your reply. >> >> Would it be possible to write a rule for this, to make it always pick >> shuffle hash join, over other join implementations(i.e. sort merge a

Re: Enforcing shuffle hash join

2016-07-04 Thread Lalitha MV
On Sat, Jul 2, 2016 at 12:58 AM, Takeshi Yamamuro wrote: > Hi, > > No, spark has no hint for the hash join. > > // maropu > > On Fri, Jul 1, 2016 at 4:56 PM, Lalitha MV wrote: > >> Hi, >> >> In order to force broadcast hash join, we can set >> the

Enforcing shuffle hash join

2016-07-01 Thread Lalitha MV
Hi, In order to force broadcast hash join, we can set the spark.sql.autoBroadcastJoinThreshold config. Is there a way to enforce shuffle hash join in spark sql? Thanks, Lalitha