lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Blargy <>
Subject Re: anyone use hadoop+solr?
Date Tue, 22 Jun 2010 20:37:38 GMT

Muneeb Ali wrote:
> Hi Blargy,
> Nice to hear that I am not alone ;) 
> Well we have been using Hadoop for other data-intensive services, those
> that can be done in parallel. We have multiple nodes, which are used by
> Hadoop for all our MapReduce jobs. I personally don't have much experience
> with its use and hence wouldn't be able to help you much with that.
> Our indexing takes 6+ hours to index 15 million documents (using
> solrj.streamUpdateSolrServer). I wanted to explore hadoop for this task,
> as it can be done in parallel.
> I have just started investigating into this, will keep this post updated
> if found anything helpful.
> -Neeb 

Would you mind explaining how your full indexing strategy is implemented
using the StreamingUpdateSolrServer? I am currently only familar with using
the DataImportHandler. Thanks.
View this message in context:
Sent from the Solr - User mailing list archive at

View raw message