xudong963 commented on code in PR #15296:
URL: https://github.com/apache/datafusion/pull/15296#discussion_r2007748334


##########
datafusion/expr-common/src/statistics.rs:
##########
@@ -203,6 +203,138 @@ impl Distribution {
         };
         Ok(dt)
     }
+
+    /// Merges two distributions into a single distribution that represents 
their combined statistics.
+    /// This creates a more general distribution that approximates the mixture 
of the input distributions.
+    ///
+    /// # Important Notes
+    ///
+    /// - The resulting mean, median, and variance are approximations of the 
mixture
+    ///   distribution parameters. They are calculated using weighted averages 
based on
+    ///   the input distributions. Users should not make definitive 
assumptions based on these values.

Review Comment:
   I agree, this will result in larger uncertainties.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to