nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sebastian Nagel (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2055) Random Crawl Delay
Date Thu, 02 Jul 2015 14:39:05 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14612022#comment-14612022
] 

Sebastian Nagel commented on NUTCH-2055:
----------------------------------------

looks good (not tested yet)
* the description of the new property needs to be updated so that it describes the current
behavior
* the patch includes a change in .../storage/Host.java, probably by mistake

> Random Crawl Delay
> ------------------
>
>                 Key: NUTCH-2055
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2055
>             Project: Nutch
>          Issue Type: New Feature
>    Affects Versions: 2.3
>            Reporter: Talat UYARER
>            Priority: Trivial
>             Fix For: 2.4
>
>         Attachments: NUTCH-2055.patch
>
>
> Some Firewalls can block that request with same delay time. I create a patch for random
crawl delay between 0 and max Crawl Delay settings.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message