lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From James Cooper <>
Subject Re: sanity check - index size
Date Tue, 21 May 2002 00:18:06 GMT
On Mon, 20 May 2002, Erik Hatcher wrote:

> I'm indexing 900+ files (less than 1,000) that total about 15MB in size.
> My index (specifically _sd.fdt) is 20MB.
> Does that seem at all close to being reasonable/normal?  I am calling
> optimize() before closing the index.


I've wondered the same thing.  The indexes I build with Lucene are
generally around the same size as the corpus.  That was larger than I
thought it would be, but it doesn't really matter since disk is pretty
cheap (and my corpus isn't very big).

-- James

To unsubscribe, e-mail:   <>
For additional commands, e-mail: <>

View raw message