[ 
https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969202#comment-13969202
 ] 

Steven Bower commented on SOLR-5302:
------------------------------------

[~gsingers] I agree that the ideal should be to have everything work in 
distributed mode (makes thins way less confusing for people). However 
substantial work would be needed to make this functionality work in a 
multi-shard environment.. We'd essentially need a generic distributed 
map-reduce implementation that could run inside a query. +1 for that... This is 
because of some of the stats are not easily computed without knowing all the 
values in one place (eg median/percentiles).

I believe that there is substantial value in what exists in this patch and that 
we continue work into the future to design/implement multi-shard support for 
analytics. 

> Analytics Component
> -------------------
>
>                 Key: SOLR-5302
>                 URL: https://issues.apache.org/jira/browse/SOLR-5302
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Steven Bower
>            Assignee: Erick Erickson
>             Fix For: 5.0
>
>         Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch, 
> SOLR-5302.patch, Search Analytics Component.pdf, Statistical Expressions.pdf, 
> solr_analytics-2013.10.04-2.patch
>
>
> This ticket is to track a "replacement" for the StatsComponent. The 
> AnalyticsComponent supports the following features:
> * All functionality of StatsComponent (SOLR-4499)
> * Field Faceting (SOLR-3435)
> ** Support for limit
> ** Sorting (bucket name or any stat in the bucket
> ** Support for offset
> * Range Faceting
> ** Supports all options of standard range faceting
> * Query Faceting (SOLR-2925)
> * Ability to use overall/field facet statistics as input to range/query 
> faceting (ie calc min/max date and then facet over that range
> * Support for more complex aggregate/mapping operations (SOLR-1622)
> ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, 
> median, percentiles
> ** Operations: negation, abs, add, multiply, divide, power, log, date math, 
> string reversal, string concat
> ** Easily pluggable framework to add additional operations
> * New / cleaner output format
> Outstanding Issues:
> * Multi-value field support for stats (supported for faceting)
> * Multi-shard support (may not be possible for some operations, eg median)



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to