lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <>
Subject Re: Applying term frequency thresholds on indexing time
Date Mon, 24 May 2010 23:59:54 GMT
Why do you want to calculate this? This is done for
you by the indexing process and taken into account
when searching.

You're asking for a solution before defining the problem,
which makes it very hard to help.


On Mon, May 24, 2010 at 7:25 AM, Xaida <> wrote:

> Hi guys!
> does there exist a way to define some threshold on the terms I wanna store
> in the index(before they are indexed). I need to store the terms  with
> higheest frequencies. I done it with term vectors and some cutoff ratio
> that
> cuts off the least occuring terms, but all this is, ofcourse works during
> retrieval time, reading from index.
> I know it make no sense to be able to calculate frequencies of the terms
> before they are stored, but i guess there could be some way to work it
> around???
> All hellp appreciated!
> Thank you!
> --
> View this message in context:
> Sent from the Lucene - Java Users mailing list archive at
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message