[
https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969478#comment-13969478
]
Grant Ingersoll commented on SOLR-5302:
---------------------------------------
I don't agree. Distributed is and should be the default mode we do everything
in going forward and if we don't account for it up front, then we end up making
all kinds of compromises on it and/or it takes years to get done (just look at
MLT). I can almost guarantee you the first question on the list once this is
released is "how come it doesn't work in distributed". This is not a case of
the "perfect being the enemy of the good enough", but a case of missing the
fact that the usage of distributed is the world we live in and so this patch
only serves those going backwards and not those going forward.
It would be one thing if this issue had a plan for what can be distributed and
what can't and an approach outlined such that it _could_ be implemented sooner
rather than later, but that doesn't appear to be the case, AFAICT. For
instance, some of the stats that can't be easily distributed do have
approximations that can be.
bq. We'd essentially need a generic distributed map-reduce implementation that
could run inside a query. +1 for that.
See https://issues.apache.org/jira/browse/SOLR-5069.
> Analytics Component
> -------------------
>
> Key: SOLR-5302
> URL: https://issues.apache.org/jira/browse/SOLR-5302
> Project: Solr
> Issue Type: New Feature
> Reporter: Steven Bower
> Assignee: Erick Erickson
> Fix For: 5.0
>
> Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch,
> SOLR-5302.patch, Search Analytics Component.pdf, Statistical Expressions.pdf,
> solr_analytics-2013.10.04-2.patch
>
>
> This ticket is to track a "replacement" for the StatsComponent. The
> AnalyticsComponent supports the following features:
> * All functionality of StatsComponent (SOLR-4499)
> * Field Faceting (SOLR-3435)
> ** Support for limit
> ** Sorting (bucket name or any stat in the bucket
> ** Support for offset
> * Range Faceting
> ** Supports all options of standard range faceting
> * Query Faceting (SOLR-2925)
> * Ability to use overall/field facet statistics as input to range/query
> faceting (ie calc min/max date and then facet over that range
> * Support for more complex aggregate/mapping operations (SOLR-1622)
> ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean,
> median, percentiles
> ** Operations: negation, abs, add, multiply, divide, power, log, date math,
> string reversal, string concat
> ** Easily pluggable framework to add additional operations
> * New / cleaner output format
> Outstanding Issues:
> * Multi-value field support for stats (supported for faceting)
> * Multi-shard support (may not be possible for some operations, eg median)
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]