haohuaijin opened a new issue, #16724: URL: https://github.com/apache/datafusion/issues/16724
### Is your feature request related to a problem or challenge? DataFusion employs a two-phase aggregation process. In the first phase, it produces intermediate results, and in the second phase, it merges these intermediate results to generate the final output. I want to merge the intermediate results and output them directly. However, this is challenging outside of DataFusion due to some private functions and structures. To address this, we have forked DataFusion and added functionality to make these processes public. ### Describe the solution you'd like public some aggregate related api, see the pr https://github.com/haohuaijin/arrow-datafusion/pull/1, those api is used in `GroupedHashAggregateStream` that we have a modify version of `GroupedHashAggregateStream` to merge intermediate results and output intermediate results. if it ok to public, i can submit pr to datafusion. ### Describe alternatives you've considered If it cannot be made public, I need to keep the current method and migrate this part of the code every time I upgrade datafusion. If I use a library based on datafuson, I also need to fork and modify the version. ### Additional context It would be great if these function and struct can be made public. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org