[Spark SQL]: Why the OptimizeSkewedJoin rule does not optimize FullOuterJoin?

2024-07-15 Thread 王仲轩(万章)
Hi, I am a beginner in Spark and currently learning the Spark source code. I have a question about the AQE rule OptimizeSkewedJoin. I have a SQL query using SMJ FullOuterJoin, where there is read skew on the left side (the case is mentioned below). case: remote bytes read total (min, med, max)

Re: [Issue] Spark SQL - broadcast failure

2024-07-15 Thread Sudharshan V
On Mon, 8 Jul, 2024, 7:53 pm Sudharshan V, wrote: > Hi all, > > Been facing a weird issue lately. > In our production code base , we have an explicit broadcast for a small > table. > It is just a look up table that is around 1gb in size in s3 and just had > few million records and 5 columns. > >