nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ferdy Galema (JIRA)" <>
Subject [jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling
Date Wed, 04 Jul 2012 13:08:34 GMT


Ferdy Galema commented on NUTCH-1360:

Sorry for the late response, but this issue is not properly implemented (for both branch and

- IP is always stored instead of depending on property: headers.set("_ip",... should be done
only if http.getIP_Header() is true.

- appends the _ip:<true or false> property to the request string?
What is the purpose of that? If not intentional, we should simply revert this. On top of that
it uses the property with a default of "true", but is should be "false" if the adding to request
string is intentional.


> Suport the storing of IP address connected to when web crawling
> ---------------------------------------------------------------
>                 Key: NUTCH-1360
>                 URL:
>             Project: Nutch
>          Issue Type: New Feature
>          Components: protocol
>    Affects Versions: nutchgora, 1.5
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: nutchgora, 1.6
>         Attachments: NUTCH-1360-nutchgora-v2.patch, NUTCH-1360-nutchgora.patch, NUTCH-1360-trunk.patch
> Simple issue enabling us to capture the specific IP address of the host which we connect
to to fetch a page.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message