lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Blargy <zman...@hotmail.com>
Subject Re: anyone use hadoop+solr?
Date Tue, 22 Jun 2010 20:37:38 GMT


Muneeb Ali wrote:
> 
> Hi Blargy,
> 
> Nice to hear that I am not alone ;) 
> 
> Well we have been using Hadoop for other data-intensive services, those
> that can be done in parallel. We have multiple nodes, which are used by
> Hadoop for all our MapReduce jobs. I personally don't have much experience
> with its use and hence wouldn't be able to help you much with that.
> 
> Our indexing takes 6+ hours to index 15 million documents (using
> solrj.streamUpdateSolrServer). I wanted to explore hadoop for this task,
> as it can be done in parallel.
> 
> I have just started investigating into this, will keep this post updated
> if found anything helpful.
>  
> -Neeb 
> 

Would you mind explaining how your full indexing strategy is implemented
using the StreamingUpdateSolrServer? I am currently only familar with using
the DataImportHandler. Thanks.
-- 
View this message in context: http://lucene.472066.n3.nabble.com/anyone-use-hadoop-solr-tp485333p915227.html
Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message