Thanks for the replies, Jihoon and Gian. I have created a proposal - https://github.com/apache/incubator-druid/issues/7303.
On Tue, Mar 19, 2019 at 5:29 PM Gian Merlino <g...@apache.org> wrote: > (The template is on > https://github.com/apache/incubator-druid/issues/new/choose) > > It sounds cool to me too! > > On Tue, Mar 19, 2019 at 5:19 PM Jihoon Son <ghoon...@gmail.com> wrote: > > > Sounds great! > > Would you mind writing a proposal about this? > > > > Jihoon > > > > On Tue, Mar 19, 2019 at 3:54 PM Samarth Jain <sama...@apache.org> wrote: > > > > > Hi, > > > > > > T-Digest (https://github.com/tdunning/t-digest) data-structure is > > another > > > way of computing sketches, rank based statistics and trimmed means over > > > numeric data. At my day job, we have been using a t-digest backed Druid > > > aggregator module which generally has been working out well for the use > > > cases of respective teams. I think it would be valuable to have > T-Digest > > > backed aggregators in Druid along with other sketch algorithms like > > moments > > > and yahoo quantile sketches. > > > > > > T-Digest has also been adopted by other projects including: > > > > > > Elastic Search - > > > > > > > > > https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-metrics-percentile-aggregation.html#search-aggregations-metrics-percentile-aggregation-approximation > > > > > > stream-lib ( > > > > > > > > > https://github.com/addthis/stream-lib/blob/master/src/main/java/com/clearspring/analytics/stream/quantile/TDigest.java > > > ) > > > > > > Apache Mahout - > > > > > > > > > https://archive.cloudera.com/cdh5/cdh/5/mahout/mahout-math/org/apache/mahout/math/stats/TDigest.html > > > > > > I have been working on cleaning up and improving performance of the > > module > > > and would like to contribute it. I would like to see what does the > > > community think about it. > > > > > > Thanks, > > > Samarth > > > > > >