berkaysynnada commented on PR #14160:
URL: https://github.com/apache/datafusion/pull/14160#issuecomment-2601723615

   `BatchCoalescer` is not used in joins yet, since CoalesceBatchesExec appears 
after the joins having filter, in case of the output batches might have a  
lower row count than target batch size. So, why cannot we follow the same 
pattern in SMJ? If collecting batches in the join itself is more performant, 
then we should also refactor the other joins as well?
   
   On the other hand, `BatchSplitter` is used in other joins, and SMJ could 
(should) have it too, as there is no other way of splitting the batches 
according to target batch size.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to