lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From PeterKerk <petervdk...@hotmail.com>
Subject Re: Flexible search field analyser/tokenizer configuration
Date Wed, 01 Oct 2014 20:32:46 GMT
Hi Erick,

Thanks for clarifying some of this :)

That triggers a few more questions:

1. I have no df" setting in my solrconfig.xml file at all, nor do I see a
<requestHandler name=&quot;/select&quot; anywhere. How would this typically
look? 
2. My site is in 2 languages, Dutch and English. So I now added the Dutch
stopwords like below to my field definition. However, I also want to exclude
English stopwords...does that mean I need to define this field definition
for each language or can I add stopwords for multiple languages in the same
field definition?

	&lt;fieldType name=&quot;searchtext&quot; class=&quot;solr.TextField&quot;
positionIncrementGap=&quot;100&quot;>
      <analyzer type="index">
		 <tokenizer class="solr.WhitespaceTokenizerFactory"/>       
		 <filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords_dutch.txt"/>		 
		 <filter class="solr.LowerCaseFilterFactory"/> 
		 <filter class="solr.EdgeNGramFilterFactory" minGramSize="2"
maxGramSize="20" side="front" /> 
      </analyzer>
      <analyzer type="query">
		 <tokenizer class="solr.WhitespaceTokenizerFactory"/>   
		<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords_dutch.txt"/>				 
		 <filter class="solr.LowerCaseFilterFactory"/> 
		 <filter class="solr.EdgeNGramFilterFactory" minGramSize="2"
maxGramSize="20" side="front" /> 
      </analyzer>
    </fieldType>

3. fq:the AND Royal AND Garden works indeed, but how would I go about to
make sure that in that query
	a. "the" is ignored
	b. "The Royal Garden" is returned as the 1st result since it's an exact
match and "Royal" as the 2nd results since it's a partial match (on
non-stopwords)?	I guess that would be via the ranking you mention, but where
to configure that for my usecase? I have seen weights on results by using
the ^ operator, e.g. &qf=title_search^20.0+province^15+city_search^10.0 but
I doubt that is the way to go here.



--
View this message in context: http://lucene.472066.n3.nabble.com/Flexible-search-field-analyser-tokenizer-configuration-tp4161624p4162200.html
Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message