alamb opened a new pull request, #15661: URL: https://github.com/apache/datafusion/pull/15661
Draft as this builds on https://github.com/apache/datafusion/pull/15659 ## Which issue does this PR close? - Follow on to https://github.com/apache/datafusion/pull/15659 - Related to https://github.com/apache/datafusion/pull/15503 - Part of https://github.com/apache/datafusion/issues/8229 - Part of https://github.com/apache/datafusion/issues/15495 ## Rationale for this change As @xudong963 works on getting statistics code into better shape we have to move the code to combine multiple `Statistics` objects (`compute_summary_statistics`) and make it public so it can be reused (see https://github.com/apache/datafusion/pull/15503) While we are doing this, lets consolidate the functionality into `Statistics` where it is 1. Easier to discover (and thus so we don't end up with multiple copies) 2. Easier to test (e.g for statistics with different schemas) 3. Easier to document ## What changes are included in this PR? 1. Add `Statistics::merge_iter` 2. Add `Statistics::merge` 2. Remove `compute_summary_statistics` ## Are these changes tested? Yes, by existing tests and newly added doc test and unit test ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> <!-- If there are any breaking changes to public APIs, please add the `api change` label. --> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org