lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Muneeb Ali <>
Subject Re: anyone use hadoop+solr?
Date Tue, 22 Jun 2010 16:57:25 GMT

Hi Blargy,

Nice to hear that I am not alone ;) 

Well we have been using Hadoop for other data-intensive services, those that
can be done in parallel. We have multiple nodes, which are used by Hadoop
for all our MapReduce jobs. I personally don't have much experience with its
use and hence wouldn't be able to help you much with that.

Our indexing takes 6+ hours to index 15 million documents (using
solrj.streamUpdateSolrServer). I wanted to explore hadoop for this task, as
it can be done in parallel.

I have just started investigating into this, will keep this post updated if
found anything helpful.
View this message in context:
Sent from the Solr - User mailing list archive at

View raw message