lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amir Hossein Jadidinejad <>
Subject Re: Controlled Indexing -New Feature
Date Sat, 29 Nov 2008 07:03:10 GMT
Yes, It's good to filter words but suppose that my controlled vocabulary contains synonyms
and etc.Exactly, I want to supervise on indexing process.

Thank you very much.

From: German Kondolf <>
Sent: Friday, November 28, 2008 7:38:34 PM
Subject: Re: Controlled Indexing -New Feature

You could use a "reverse" stop-word filter.

The straight "StopFilter" actually removes the keywords that match with a
given Set of words, you could do the reverse logic of that an remove ALL
keywords that doesn't match that Set.

Take a look at StopFilter and StandardAnalyzer ;)

On Fri, Nov 28, 2008 at 4:38 PM, Amir Hossein Jadidinejad <> wrote:

> Hi,
> I'm going to index some documents only with known phrases. Let me describe:
> Suppose that I have a controlled vocabulary(a list of some known
> phrases/words). I intend to
> index ONLY these phrases within my documents and have a retrieval model
> (for example simple VS-TF.IDF). In way, each index item is one of the
> predefined
> phrase.
> Is it possible to handle this situation with Lucene?
> What is the best way?!
> Greatly appreciate any comments or answers.
> Kind regards,

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message