nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Armel Nene (JIRA)" <>
Subject [jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content
Date Sun, 12 Nov 2006 11:46:40 GMT
    [ ] 
Armel Nene commented on NUTCH-61:

Has this patch by any chance been included in the newer release of nucth or is any one using
as Otis asked. The reason is I am about to build a similar patch but if this patch is already
working, I can just adapt it to my context. Or will nutch in the future planning to provide
this feature out of the box? 

> Adaptive re-fetch interval. Detecting umodified content
> -------------------------------------------------------
>                 Key: NUTCH-61
>                 URL:
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>            Reporter: Andrzej Bialecki 
>         Assigned To: Andrzej Bialecki 
>         Attachments: 20050606.diff, 20051230.txt, 20060227.txt, nutch-61-417287.patch
> Currently Nutch doesn't adjust automatically its re-fetch period, no matter if individual
pages change seldom or frequently. The goal of these changes is to extend the current codebase
to support various possible adjustments to re-fetch times and intervals, and specifically
a re-fetch schedule which tries to adapt the period between consecutive fetches to the period
of content changes.
> Also, these patches implement checking if the content has changed since last fetching;
protocol plugins are also changed to make use of this information, so that if content is unmodified
it doesn't have to be fetched and processed.

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators:
For more information on JIRA, see:


View raw message