nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Frédéric Passaniti <>
Subject Nutch readings for developers
Date Thu, 22 May 2014 14:34:11 GMT
Hello everyone,

I'm looking for some litterature/readings about HOW TO develop plugins in
nutch, understand very well the deep architecture of the crawler.
What are the different entry points for custom code in the crawling and
indexing process.
More particullary how to develop custom parsers and content extractors, how
to redirect the parsed content into a custom storage service etc...

If you have good blogs, sites, wikis or even git/googlecode small project
to have a look....

It would be much appreciated !!

Thank you !

Frédéric Passaniti

View raw message