lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris Miller" <>
Subject Re: commercial websites powered by Lucene?
Date Tue, 24 Jun 2003 11:27:43 GMT
Hmm, good point with the cost of copying indicies in a distributed
environment, although that is unlikely to affect us in the foreseeable
future. But, noted!

Do you have any rough statistics on how many documents you index/day, or how
many every 20 minutes?

This discussion is fantastic by the way, lots of great experience and
comments coming out here. Thanks, it's really appreciated.

"Nader S. Henein" <> wrote in message
> We thought of that in the beginning and then we became more comfortable
> with multiple indices for simple backup purposes, and now our indices
> are in excess of 100megs, and transferring that kind of data between
> three machines sitting in the same data center is passable, but once you
> start thinking of distributed webservers in different hosting
> facilities, copying  100Megs every 20 minutes, or even every hour
> becomes financially expensive.
> Our webservers are on Single Processor Sun Ultra Sparc III 400 Mhz with
> two gegs of memory, and I've never seen the CPU usage go over 0.8 at
> peek time with the indexer running. Try it out first, take your time to
> gather your own numbers so you can really get  a feel of what set up
> fits you best.
> Nader

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message