lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steven Anderson" <>
Subject Large Data Set Suggestions
Date Wed, 05 Nov 2008 15:52:30 GMT
I've been asked to do some indexing performance testing on Solr 1.3
using large XML document data sets (10M-60M docs) with DIH versus SolrJ.

Does anyone have any suggestions where I might find a good data set this
I saw the wikipedia dump reference in the DIH wiki, but that is only in
the 7M+ doc range.
Any suggestions would be greatly appreciated.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message