lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Spencer <>
Subject Re: NGramSpeller contribution -- Re: combining open office spellchecker with Lucene
Date Wed, 15 Sep 2004 16:53:32 GMT
Andrzej Bialecki wrote:

> Aad Nales wrote:
>> David,
>> Perhaps I misunderstand somehting so please correct me if I do. I used
>> to look for conts without
>> changing any of the default values. What I got as results did not
>> include 'const' which has quite a high frequency in your index and
> ??? how do you know that? Remember, this is an index of _Java_docs, and 
> "const" is not a Java keyword.

I added a line of output to the right column under the 'details' box. 
"const" appears 216 times in the index (out of 96k docs), thus it is 
indeed kinda rare.

>> should have a pretty low levenshtein distance. Any idea what causes this
>> behavior?

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message