lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gena Batsyan <gbat...@gmail.com>
Subject indexing/crawling HTML + solr
Date Wed, 03 Jun 2009 10:09:36 GMT
Hi!

to be short, where to start with the subject?

Any pointers to some [semi-]functional solutions that crawl the web as a 
normal crawler, take care about html parsing, etc, and feed the crawled 
stuff as solr-documents per <add>  ?

regards!



Mime
View raw message