lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shri Kanish <shrikanis...@ymail.com>
Subject Solr spellcheck for words with quotes
Date Wed, 01 Aug 2012 18:19:04 GMT
Hi ,
I use solr as search engine for our application. WE have a title "Pandora's star". When I
give a query as 
http://localhost:8983/solr/select?q=pandora's star&spellcheck=true &spellcheck.collate=true
 
I get response as below,

- <lst name="spellcheck">


- <lst name="suggestions">


- <lst name="pandora">


  <int name="numFound">1</int> 

  <int name="startOffset">10</int> 

  <int name="endOffset">17</int> 

- <arr name="suggestion">


  <str>pandora's</str> 
  </arr>
  </lst>

  <str name="collation">text_engb:pandora's's star</str> 
  </lst>
  </lst>
 
The word goes as pandora and not as pandora's. An additional  's is appended to the collation
result. Below is my configuraion for spellcheck
 

<fieldType name="textSpell" class="solr.TextField" positionIncrementGap="100" omitNorms="true">
<analyzer type="index">
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_selma.txt"/>
<filter class="solr.StandardFilterFactory"/> 
<filter class="solr.LowerCaseFilterFactory"/> 
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
</analyzer>
<analyzer type="query">
 <tokenizer class="solr.StandardTokenizerFactory"/> 
 
<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_selma.txt"/>
<filter class="solr.StandardFilterFactory"/> 
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
</analyzer>
</fieldType>
 
Please suggest
 
Thanks,
Shri
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message