nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ferdy Galema (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling
Date Wed, 04 Jul 2012 13:08:34 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13406495#comment-13406495
] 

Ferdy Galema commented on NUTCH-1360:
-------------------------------------

Sorry for the late response, but this issue is not properly implemented (for both branch and
trunk).

- IP is always stored instead of depending on property: headers.set("_ip",... should be done
only if http.getIP_Header() is true.

- http.store.ip.address appends the _ip:<true or false> property to the request string?
What is the purpose of that? If not intentional, we should simply revert this. On top of that
it uses the property with a default of "true", but is should be "false" if the adding to request
string is intentional.

Thanks.

                
> Suport the storing of IP address connected to when web crawling
> ---------------------------------------------------------------
>
>                 Key: NUTCH-1360
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1360
>             Project: Nutch
>          Issue Type: New Feature
>          Components: protocol
>    Affects Versions: nutchgora, 1.5
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: nutchgora, 1.6
>
>         Attachments: NUTCH-1360-nutchgora-v2.patch, NUTCH-1360-nutchgora.patch, NUTCH-1360-trunk.patch
>
>
> Simple issue enabling us to capture the specific IP address of the host which we connect
to to fetch a page.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message