nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fırat KÜÇÜK (JIRA) <j...@apache.org>
Subject [jira] [Comment Edited] (NUTCH-1531) URL filtering takes long time for very long URLs
Date Wed, 13 Feb 2013 08:28:12 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13577405#comment-13577405
] 

Fırat KÜÇÜK edited comment on NUTCH-1531 at 2/13/13 8:26 AM:
-------------------------------------------------------------

patch attached
                
      was (Author: firatkucuk):
    patch
                  
> URL filtering takes long time for very long URLs
> ------------------------------------------------
>
>                 Key: NUTCH-1531
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1531
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.6, 2.1, 1.7, 2.2
>            Reporter: Fırat KÜÇÜK
>            Priority: Minor
>             Fix For: 1.7, 2.2
>
>         Attachments: max_url_length.diff, test_case.txt
>
>
> Some very long urls (such as base64 image generators) take long time (hours). So some
url length limitation needed. On reducing phase it locks down all the system for hours.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message