lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: solr 5.2->7.2, suggester failure
Date Tue, 03 Apr 2018 18:41:32 GMT
What kinds of things go into your title field? On first blush that's a
bit odd for a multi-word title field since it treats the entire input
as a single string. The code is trying to build a large FST to hold
all of this data. Would AnalyzingInfixLookupFactory or similar make
more sense?

buildOnStartup and buildOnOptimize are other red flags. This means
that every time you start up, the data for the title field is read
from disk and the FST is built (or index if you use a different impl).
On a large corpus this may take many minutes.

Best,
Erick

On Tue, Apr 3, 2018 at 11:28 AM, David Hastings
<hastings.recursive@gmail.com> wrote:
> Hey all, I recently got a 7.2 instance up and running, and it seems to be
> going well however, I have ran into this when creating one of my indexes,
> and was wondering if anyone had a quick idea right off the top of their
> head.
>
> solrconfig:
>
> <searchComponent name="suggest" class="solr.SuggestComponent">
>   <lst name="suggester">
>     <str name="name">fixspell</str>
>     <str name="lookupImpl">FuzzyLookupFactory</str>
>
>         <str name="suggestAnalyzerFieldType">string</str>
>
>     <str name="dictionaryImpl">DocumentDictionaryFactory</str>
>     <str name="field">title</str>
>     <str name="buildOnStartup">true</str>
>     <str name="buildOnOptimize">true</str>
>   </lst>
>
>
> received error:
>
>
> ERROR true
> SuggestComponent
> Exception in building suggester index for: fixspell
> java.lang.IllegalArgumentException: input automaton is too large: 1001
> at
> org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse(Operations.java:1298)
> at
> org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse(Operations.java:1306)
> at
> org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse(Operations.java:1306)
>
> .....
>
> at
> org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse(Operations.java:1306)
> at
> org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse(Operations.java:1306)
> at
> org.apache.lucene.util.automaton.Operations.topoSortStates(Operations.java:1275)
> at
> org.apache.lucene.search.suggest.analyzing.AnalyzingSuggester.replaceSep(AnalyzingSuggester.java:292)
> at
> org.apache.lucene.search.suggest.analyzing.AnalyzingSuggester.toAutomaton(AnalyzingSuggester.java:854)
> at
> org.apache.lucene.search.suggest.analyzing.AnalyzingSuggester.build(AnalyzingSuggester.java:430)
> at org.apache.lucene.search.suggest.Lookup.build(Lookup.java:190)
> at
> org.apache.solr.spelling.suggest.SolrSuggester.build(SolrSuggester.java:181)
> at
> org.apache.solr.handler.component.SuggestComponent$SuggesterListener.buildSuggesterIndex(SuggestComponent.java:529)
> at
> org.apache.solr.handler.component.SuggestComponent$SuggesterListener.newSearcher(SuggestComponent.java:511)
> at org.apache.solr.core.SolrCore.lambda$getSearcher$17(SolrCore.java:2275)

Mime
View raw message