Have a look at this presentation. http://www.slideshare.net/colorant/spark-shuffle-introduction . Can be of help to you.
On Sat, Aug 15, 2015 at 1:42 PM, Muhammad Haseeb Javed < 11besemja...@seecs.edu.pk> wrote: > What are the major differences between how Sort based and Hash based > shuffle operate and what is it that cause Sort Shuffle to perform better > than Hash? > Any talks that discuss both shuffles in detail, how they are implemented > and the performance gains ? >