On Sat, Apr 16, 2016 at 1:05 PM, Matthias J. Sax <mj...@apache.org> wrote: > (with the need to sort the data, because both > datasets will be sorted on A already). Thus, the overhead of sorting in > the group might pay of in the join.
I think you meant to write withOUT the need to the sort the data, right?