lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Neeb <>
Subject Re: solr with hadoop
Date Tue, 22 Jun 2010 15:40:03 GMT


We currently have a master-slave setup for solr with two slave servers. We
are using Solrj (stream-update-solr-server) to index master slave, which
takes 6 hours to index around 15 million documents.

I would like to explore hadoop, in particularly for indexing job using
mapreduce approach. 

- I have read some comments on the JIRA tickets, but it still seems unclear
how this setup will work. 
- I am not sure as what tasks will be done at map phase and what on reduce
- And would it merge the multiple indices together into one during reduce
phase or is this a separate task out of mapreduce?

Any directions and guidance over this setup would be highly appreciated.

Thanks in advance,
View this message in context:
Sent from the Solr - User mailing list archive at

View raw message