comphead opened a new issue, #14238:
URL: https://github.com/apache/datafusion/issues/14238

                 `BatchCoalescer` is not used in joins yet, since 
CoalesceBatchesExec appears after the joins having filter, in case of the 
output batches might have a  lower row count than target batch size. So, why 
cannot we follow the same pattern in SMJ? If collecting batches in the join 
itself is more performant, then we should also refactor the other joins as well?
   
   On the other hand, `BatchSplitter` is used in other joins, and SMJ could 
(should) have it too, as there is no other way of splitting the batches 
according to target batch size.
   
   _Originally posted by @berkaysynnada in 
https://github.com/apache/datafusion/issues/14160#issuecomment-2601723615_
               


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to