haohuaijin opened a new issue, #16724:
URL: https://github.com/apache/datafusion/issues/16724

   ### Is your feature request related to a problem or challenge?
   
   DataFusion employs a two-phase aggregation process. In the first phase, it 
produces intermediate results, and in the second phase, it merges these 
intermediate results to generate the final output.
   
   I want to merge the intermediate results and output them directly. However, 
this is challenging outside of DataFusion due to some private functions and 
structures. To address this, we have forked DataFusion and added functionality 
to make these processes public.
   
   ### Describe the solution you'd like
   
   public some aggregate related api, see the pr 
https://github.com/haohuaijin/arrow-datafusion/pull/1, those api is used in 
`GroupedHashAggregateStream` that we have a modify version of 
`GroupedHashAggregateStream` to merge intermediate results and output 
intermediate results. if it ok to public, i can submit pr to datafusion.
   
   ### Describe alternatives you've considered
   
   If it cannot be made public, I need to keep the current method and migrate 
this part of the code every time I upgrade datafusion. If I use a library based 
on datafuson, I also need to fork and modify the version.
   
   ### Additional context
   
   It would be great if these function and struct can be made public.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to