Split planning works the way it currently does to prioritize newer files
and avoid reordering them. The idea is that newer files are the most likely
to be read, so engines like Presto that continue to plan splits as the
first splits run will return results faster.

I like the idea of returning the biggest bin first, that could be a winner
since it still mostly uses files in order (like packing) and helps solve
the size problem. Let's do it!

Reply via email to