lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tanguy Moal (JIRA)" <>
Subject [jira] Commented: (SOLR-2207) Add paging capabilities to Result Grouping functionality
Date Thu, 27 Jan 2011 16:47:43 GMT


Tanguy Moal commented on SOLR-2207:

Hello Yonik,

First, thank you for this patch, that is interesting.

However I am not interested in paginating within a group, but in knowing where will the overall
pagination stop.

I don't know if I'm doing things right, but here is my issue : 
I have some kind of a signature that I store in a field and then use it to collapse search
results within groups. Some kind of duplicate content filtering.

I can't find a clean way to obtain the total number of groups for a given query. The only
way to do that from my understanding, is to ask solr for faceting on the field holding each
document's signature, with facet.limit = -1, and then count the number of distinct signature
for the given query. I finally use very simple arithmetic (matches - count(signature facets))
to know the total number of groups .
Using this, I could finally know where to stop pagination of my results set. Obviously, that
is very suboptimal.

There *must* be a cleaner way to do so, right ?

Any help appreciated =D

> Add paging capabilities to Result Grouping functionality
> --------------------------------------------------------
>                 Key: SOLR-2207
>                 URL:
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>            Assignee: Yonik Seeley
>            Priority: Minor
>             Fix For: 4.0
>         Attachments: SOLR-2207.patch
> Would be great to have the following:
> # Paging of docs within a group via start and offset parameters such that one can drill
down into a group
> # While I know we can't return a total count of all groups, it would be nice to know
if there are more groups.  This could simply be done by asking for 1 more group than requested
by the user.  If that group is non-empty, then set a flag indicating there are more groups.
 Then, start and offset could also be applied to allow paging down the groups.  We should
also think about how we can optimize this so that we can page very deep into the results without
needing to increase the PQ size.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message