xudong963 commented on code in PR #15296: URL: https://github.com/apache/datafusion/pull/15296#discussion_r2007748334
########## datafusion/expr-common/src/statistics.rs: ########## @@ -203,6 +203,138 @@ impl Distribution { }; Ok(dt) } + + /// Merges two distributions into a single distribution that represents their combined statistics. + /// This creates a more general distribution that approximates the mixture of the input distributions. + /// + /// # Important Notes + /// + /// - The resulting mean, median, and variance are approximations of the mixture + /// distribution parameters. They are calculated using weighted averages based on + /// the input distributions. Users should not make definitive assumptions based on these values. Review Comment: I agree, this will result in larger uncertainties. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org