lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rohit Kumar <rohit.kku...@gmail.com>
Subject Searching in stopwords
Date Sun, 28 Jul 2013 01:59:06 GMT
I have a company search which uses stopwords during quezary time. In my
stopwords list i have entries like :

HR
Club
India
Pvt.
Ltd.
....


So if i search for companies like HR Club i get no results. Similarly
search for India HR giving no results. How can i get results in query for
following companies :

1. HR India
2. HR Club
3. HR India Pvt Ltd


I would still want to maintain the above list of stopwords since these
letters occur heavily in company text.

Please guide if i need to change my strategy itself.

<field name="company" type="text_lowercase_whitespace" indexed="true"
stored="true" />



<fieldType name="text_lowercase_whitespace" class="solr.TextField"
positionIncrementGap="100">
           <analyzer type="index">
         	<tokenizer class="solr.WhitespaceTokenizerFactory"/>
     	   	<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" enablePositionIncrements="true" />
        	<filter class="solr.LowerCaseFilterFactory"/>
		<filter class="solr.PorterStemFilterFactory"/>
   		<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
           </analyzer>
 	   <analyzer type="query">
         	 <tokenizer class="solr.StandardTokenizerFactory"/>
                <filter class="solr.PorterStemFilterFactory"/>
     	   	<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" enablePositionIncrements="true" />
        	<filter class="solr.LowerCaseFilterFactory"/>
   		<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
           </analyzer>
    </fieldType>



Thanks
Rohit Kumar

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message