lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vasu Y <vya...@gmail.com>
Subject SOLR Sizing
Date Mon, 03 Oct 2016 18:08:47 GMT
Hi,
 I am trying to estimate disk space requirements for the documents indexed
to SOLR.
I went through the LucidWorks blog (
https://lucidworks.com/blog/2011/09/14/estimating-memory-and-storage-for-lucenesolr/)
and using this as the template. I have a question regarding estimating
"Avg. Document Size (KB)".

When calculating Disk Storage requirements, can we use the Java Types
sizing (
https://docs.oracle.com/javase/tutorial/java/nutsandbolts/datatypes.html) &
come up average document size?

Please let know if the following assumptions are correct.

 Data Type       Size
 --------------      ------
 long           8 bytes
 tint       4 bytes
 tdate         8 bytes (Stored as long?)
 string         1 byte per char for ASCII chars and 2 bytes per char for
Non-ASCII chars (Double byte chars)
 text           1 byte per char for ASCII chars and 2 bytes per char for
Non-ASCII (Double byte chars) (For both with & without norm?)
 ICUCollationField 2 bytes per char for Non-ASCII (Double byte chars)
 boolean 1 bit?

 Thanks,
 Vasu

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message