lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Allison, Timothy B." <talli...@mitre.org>
Subject RE: SOLR Sizing
Date Mon, 03 Oct 2016 19:19:52 GMT
This doesn't answer your question, but Erick Erickson's blog on this topic is invaluable:

https://lucidworks.com/blog/2012/07/23/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/

-----Original Message-----
From: Vasu Y [mailto:vyal2k@gmail.com] 
Sent: Monday, October 3, 2016 2:09 PM
To: solr-user@lucene.apache.org
Subject: SOLR Sizing

Hi,
 I am trying to estimate disk space requirements for the documents indexed to SOLR.
I went through the LucidWorks blog (
https://lucidworks.com/blog/2011/09/14/estimating-memory-and-storage-for-lucenesolr/)
and using this as the template. I have a question regarding estimating "Avg. Document Size
(KB)".

When calculating Disk Storage requirements, can we use the Java Types sizing (
https://docs.oracle.com/javase/tutorial/java/nutsandbolts/datatypes.html) & come up average
document size?

Please let know if the following assumptions are correct.

 Data Type       Size
 --------------      ------
 long           8 bytes
 tint       4 bytes
 tdate         8 bytes (Stored as long?)
 string         1 byte per char for ASCII chars and 2 bytes per char for
Non-ASCII chars (Double byte chars)
 text           1 byte per char for ASCII chars and 2 bytes per char for
Non-ASCII (Double byte chars) (For both with & without norm?)  ICUCollationField 2 bytes
per char for Non-ASCII (Double byte chars)  boolean 1 bit?

 Thanks,
 Vasu
Mime
View raw message