lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Halácsy Péter <>
Subject Re: Relevance boosting with the aid of semantic markup
Date Mon, 10 Dec 2001 08:09:13 GMT
Doug Cutting wrote:

>>Why can't we store some value of each word. If I could index 
>>the stems 
>>of the words as well, I gave lower value to them.
>>I know a Russion search engine that uses 3 (or 4 I don't remember) 
>>distinct value to classify each term in the index:
>>1. original word
>>2. stem
>>3. spam
>>The priority of the terms is calculated at indexing time and used for 
>Would such weighting be per word, or per word occurence?  Earlier you were
>asking for the ability to separately weight word occurences, e.g. to boost
>them if they are emphasized in the text.  That was what I was responding to.
per word occurence (don't forget it's only interesting if I can put more 
than 1 words to the same position)


To unsubscribe, e-mail:   <>
For additional commands, e-mail: <>

View raw message