lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-4280) spellcheck.maxResultsForSuggest based on filter query results
Date Tue, 16 Jul 2013 12:34:49 GMT

     [ https://issues.apache.org/jira/browse/SOLR-4280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Markus Jelsma updated SOLR-4280:
--------------------------------

    Attachment: SOLR-4280-trunk.patch

I forgot i had a working patch laying around. Specify spellcheck.percentageResultsForSuggest=0.25
to force maxResultsForSuggest to be 25% of the smallest filterQuery DocSet. This allows maxResultsForSuggest
to be adjusted dynamically based on the filters specified. 

It doesn't seem to work in a distributed environment although the parameters are passed nicely.
I haven't figured that out yet, but all shards return the same collation for undistributed
requests. Tips?
                
> spellcheck.maxResultsForSuggest based on filter query results
> -------------------------------------------------------------
>
>                 Key: SOLR-4280
>                 URL: https://issues.apache.org/jira/browse/SOLR-4280
>             Project: Solr
>          Issue Type: Improvement
>          Components: spellchecker
>            Reporter: Markus Jelsma
>             Fix For: 4.4
>
>         Attachments: SOLR-4280-trunk-1.patch, SOLR-4280-trunk.patch
>
>
> spellcheck.maxResultsForSuggest takes a fixed number but ideally should be able to take
a ratio and calculate that against the maximum number of results the filter queries return.
> At least in our case this would certainly add a lot of value. >99% of our end-users
search within one or more filters of which one is always unique. The number of documents for
each of those unique filters varies significantly ranging from 300 to 3.000.000 documents
in which they search. The maxResultsForSuggest is set to a reasonable low value so it kind
of works fine but sometimes leads to undesired suggestions for a large subcorpus that has
more misspellings.
> Spun off from SOLR-4278.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message