lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <>
Subject [jira] [Commented] (SOLR-2564) Integrating grouping module into Solr 4.0
Date Mon, 06 Jun 2011 12:44:59 GMT


Michael McCandless commented on SOLR-2564:

bq. The other use-case is more like field collapsing and does change what documents match
(basically, only the first documents in each group, up to limit, "match").

I'm not sure it's that simple, ie that we can so cleanly model
collapsing as reducing the docs to consider and then running faceting
on that reduced set.

EG, the use case of getting correct facet counts for a field that has
different values within the group, can't be handled by this approach?
This is the count=2 for size=S in my example at

I think to do that properly, the faceting impl needs to see all docs
in the group, not just the "lead doc" per group.

I think another way to visualize/model this that we really need to be
able to configure "which field counts" (ID_FIELD) for the schema.
This field would then decide all counts -- total "hit count", facet
counts, etc., ie each of these counts is count(unique(ID_FIELD)) of
the docs falling in that facet/result set.  The default is Lucene's docid,
but the app should be able to state any other ID_FIELD.

> Integrating grouping module into Solr 4.0
> -----------------------------------------
>                 Key: SOLR-2564
>                 URL:
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Martijn van Groningen
>            Assignee: Martijn van Groningen
>             Fix For: 4.0
>         Attachments: LUCENE-2564.patch, SOLR-2564.patch, SOLR-2564.patch, SOLR-2564.patch,
> Since work on grouping module is going well. I think it is time to wire this up in Solr.
> Besides the current grouping features Solr provides, Solr will then also support second
pass caching and total count based on groups.

This message is automatically generated by JIRA.
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message