The EmpiricalDistribution class in the random package is designed to
support large samples.  It does not store all of data points in
memory, but instead bins the data and uses smoothing kernels within
the bins.  I have recently had the need for a discrete empirical
distribution - i.e., an implementation that stores the full dataset
in memory and creates the empirical distribution exactly from it. 
If there are no objections, I would like to open a JIRA and commit
o.a.c.m.distribution.DiscreteEmpiricalDistribution implementing
this.  I will document the approach and design decisions in the JIRA
if others are OK with this addition.

Phil

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org

Reply via email to