lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: NewBie To Lucene || Perfect configuration on a 64 bit server
Date Wed, 28 May 2014 21:29:40 GMT
bq: We did a detailed analysis for each step and observed that
indexing per RTF file(i.e using path and content(with File Reader))
happened at the same millisecond and
On an average it took 95millisec for each file to get indexed and took
anywhere between 200 to 500millisec for file to get converted to text
using Aspose.

Do I misunderstand or is between 2/3s and 5/6ths of your time spent in
acquiring the text? In which ase even if you get Lucene down to 0ms to
index doc (impossible of course), you'll gain at best 33%.

Which is another argument for indexing all the docs and just using
filters or a TermsFilter as Arjen suggests.

Best,
Erick



On Mon, May 26, 2014 at 11:25 PM, Ralf Heyde <xoodrenalin@gmx.de> wrote:
> Hey,
>
> I have several notes about your process.
>
> 1st: How you select the documents you are passing to the index for further
> searching? Maybe it is more straight forward to "find" them on you
> programming language?
> 2nd: Storage is cheap, buy a hard-disk and store the overall index. The most
> expensive operation is the indexing and the first read access (caching on
> Lucene / OS level). Imagine what happens when you build the index and delete
> it afterwards just for a "simple" search operation on a subset of your
> documents.
>
> Cheers, Ralf
>
>
>
> -----Urspr√ľngliche Nachricht-----
> Von: rulinma [mailto:rulinma@gmail.com]
> Gesendet: Dienstag, 27. Mai 2014 03:14
> An: java-user@lucene.apache.org
> Betreff: RE: NewBie To Lucene || Perfect configuration on a 64 bit server
>
> 1000+ is solr, lucenen more fast.
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/NewBie-To-Lucene-Perfect-configuration-on
> -a-64-bit-server-tp4136871p4138215.html
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message