Yes, indeed! A few talks in the developer and deep dives address the data skews issue and how to address them.
I shall let the group know when the talk sessions are available. Cheers Jules Sent from my iPhone Pardon the dumb thumb typos :) > On Apr 29, 2019, at 2:13 PM, Michael Mansour <[email protected]> > wrote: > > There were recently some fantastic talks about this at the SparkSummit > conference in San Francisco. I suggest you check out the SparkSummit YouTube > channel after May 9th for a deep dive into this topic. > > From: rajat kumar <[email protected]> > Date: Monday, April 29, 2019 at 9:34 AM > To: "[email protected]" <[email protected]> > Subject: [EXT] handling skewness issues > > Hi All, > > How to overcome skewness issues in spark ? > > I read that we can add some randomness to key column before join and remove > that random part after join. > > is there any better way ? Above method seems to be a workaround. > > thanks > rajat
