alamb opened a new pull request, #16249:
URL: https://github.com/apache/datafusion/pull/16249

   ## Which issue does this PR close?
   
   
   - Related to https://github.com/apache/arrow-rs/issues/6692
   - Related to #3463 
   
   ## Rationale for this change
   
   I am trying to move the coalesce operation upstream into arrow-rs so that it 
can be reused in parquet filter pushdown and more highly optimized. See :
   - https://github.com/apache/arrow-rs/pull/7597
   
   The proposed upstream coalescer API is slightly different (guaranteed batch 
size, and doesn't have limit) so we must adapt the DataFusion code to handle 
this 
   
   ## What changes are included in this PR?
   
   This PR refactors the `BatchCoalescer` in DataFusion to use the proposed 
upstream API to show that it
   1. Can be used (api is complete enough)
   2. Is not any slower 
   
   
   
   ## Are these changes tested?
   
   By CI tests
   
   ## Are there any user-facing changes?
   
   No


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to