Dandandan commented on issue #6822:
URL: https://github.com/apache/datafusion/issues/6822#issuecomment-2747639530
   I wrote some ideas of supporting selection vectors inside hash join and 
aggregate (I believe we didn't have those issues?)
   
   This seems to be likely to give more substantial gains than trying to 
optimize only the partitioning code only as, even optimized, we still need to 
copy the inputs (and run CoalesceBatchesExec).
   
   https://github.com/apache/datafusion/issues/15382
   https://github.com/apache/datafusion/issues/15383
   
   I think (at least for join, aggregate I am less certain) it might not be too 
hard to implement.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to