lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steven Bower (JIRA)" <>
Subject [jira] [Commented] (SOLR-5302) Analytics Component
Date Tue, 15 Apr 2014 16:48:15 GMT


Steven Bower commented on SOLR-5302:

If someone wants to do that work that's great.. I don't have plans to work on multi-shard
at the moment (this will change in the future) as I just don't have a use-case for it... we
will though.. If someone wants to pick it up I'd gladly assist...

I understand the intention to have everything cloud compatible.. The reality is that many
components suffer from inconsistencies when in cloud mode (MLT, All the join work being done
and in Solr, FieldCollapsing, etc..) I think it should be the intention to make things work
in cloud mode however some use-case don't really make sense in distributed mode when you look
at the cost of the implementation.. we can do analytics very quickly in solr with this component
but doing this as a map-reduce/distributed implementation may prove to be prohibitively time
consuming at query time and thus may not ever get used in distributed configurations..

Anyway I'd like to see this get in prior to supporting multi-node as it will probably be a
long while before the infrastructure is in place to support it (ie the map-reduce ticket)

> Analytics Component
> -------------------
>                 Key: SOLR-5302
>                 URL:
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Steven Bower
>            Assignee: Erick Erickson
>             Fix For: 5.0
>         Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch,
Search Analytics Component.pdf, Statistical Expressions.pdf, solr_analytics-2013.10.04-2.patch
> This ticket is to track a "replacement" for the StatsComponent. The AnalyticsComponent
supports the following features:
> * All functionality of StatsComponent (SOLR-4499)
> * Field Faceting (SOLR-3435)
> ** Support for limit
> ** Sorting (bucket name or any stat in the bucket
> ** Support for offset
> * Range Faceting
> ** Supports all options of standard range faceting
> * Query Faceting (SOLR-2925)
> * Ability to use overall/field facet statistics as input to range/query faceting (ie
calc min/max date and then facet over that range
> * Support for more complex aggregate/mapping operations (SOLR-1622)
> ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, median,
> ** Operations: negation, abs, add, multiply, divide, power, log, date math, string reversal,
string concat
> ** Easily pluggable framework to add additional operations
> * New / cleaner output format
> Outstanding Issues:
> * Multi-value field support for stats (supported for faceting)
> * Multi-shard support (may not be possible for some operations, eg median)

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message