lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steven Bower (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-5302) Analytics Component
Date Tue, 15 Apr 2014 04:27:27 GMT

    [ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969202#comment-13969202
] 

Steven Bower commented on SOLR-5302:
------------------------------------

[~gsingers] I agree that the ideal should be to have everything work in distributed mode (makes
thins way less confusing for people). However substantial work would be needed to make this
functionality work in a multi-shard environment.. We'd essentially need a generic distributed
map-reduce implementation that could run inside a query. +1 for that... This is because of
some of the stats are not easily computed without knowing all the values in one place (eg
median/percentiles).

I believe that there is substantial value in what exists in this patch and that we continue
work into the future to design/implement multi-shard support for analytics. 

> Analytics Component
> -------------------
>
>                 Key: SOLR-5302
>                 URL: https://issues.apache.org/jira/browse/SOLR-5302
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Steven Bower
>            Assignee: Erick Erickson
>             Fix For: 5.0
>
>         Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch,
Search Analytics Component.pdf, Statistical Expressions.pdf, solr_analytics-2013.10.04-2.patch
>
>
> This ticket is to track a "replacement" for the StatsComponent. The AnalyticsComponent
supports the following features:
> * All functionality of StatsComponent (SOLR-4499)
> * Field Faceting (SOLR-3435)
> ** Support for limit
> ** Sorting (bucket name or any stat in the bucket
> ** Support for offset
> * Range Faceting
> ** Supports all options of standard range faceting
> * Query Faceting (SOLR-2925)
> * Ability to use overall/field facet statistics as input to range/query faceting (ie
calc min/max date and then facet over that range
> * Support for more complex aggregate/mapping operations (SOLR-1622)
> ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, median,
percentiles
> ** Operations: negation, abs, add, multiply, divide, power, log, date math, string reversal,
string concat
> ** Easily pluggable framework to add additional operations
> * New / cleaner output format
> Outstanding Issues:
> * Multi-value field support for stats (supported for faceting)
> * Multi-shard support (may not be possible for some operations, eg median)



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message