nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@nutch.org>
Subject Re: IndexOptimizer (Re: Lucene performance bottlenecks)
Date Wed, 14 Dec 2005 00:02:45 GMT
Andrzej Bialecki wrote:
> Ok, I just tested IndexSorter for now. It appears to work correctly, at 
> least I get exactly the same results, with the same scores and the same 
> explanations, if I run the smae queries on the original and on the 
> sorted index.

Here's a more complete version, still mostly untested.  This should make 
searches faster.  We'll see how much good the results are...

This includes a patch to Lucene to make it easier to write hit 
collectors that collect TopDocs.

I'll test this on a 38M document index tomorrow.

Cheers,

Doug

Mime
View raw message