nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From joel gump <biglaugh...@gmail.com>
Subject Re: Plugins: when to perform web service requests, on fetch or on index?
Date Thu, 18 Jun 2009 12:42:11 GMT
how about write standalone app. analyze data after crawl and index.

On Thu, Jun 18, 2009 at 6:57 PM, caezar<caezaris@gmail.com> wrote:
>
> Hi All,
>
> I'm writing several nutch plugins, which will perform a requests to some
> webservices for pages being indexed and store retrieved data in index. The
> question is: on what stage of crawling it is better to perform these
> webservice requests: on fetching or on indexing (in HtmlParseFilter or in
> IndexingFilter), in terms of performance, of course?
>
> Nutch version is 1.0, indexer is SolrIndexer.
>
> Thanks.
> --
> View this message in context: http://www.nabble.com/Plugins%3A-when-to-perform-web-service-requests%2C-on-fetch-or-on-index--tp24089858p24089858.html
> Sent from the Nutch - Dev mailing list archive at Nabble.com.
>
>

Mime
View raw message