nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2190) Protocol normalizer
Date Tue, 12 Jan 2016 10:59:40 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093734#comment-15093734
] 

Hudson commented on NUTCH-2190:
-------------------------------

SUCCESS: Integrated in Nutch-trunk #3334 (See [https://builds.apache.org/job/Nutch-trunk/3334/])
NUTCH-2190 Protocol normalizer (markus: [http://svn.apache.org/viewvc/nutch/trunk/?view=rev&rev=1724199])
* trunk/conf/protocols.txt


> Protocol normalizer
> -------------------
>
>                 Key: NUTCH-2190
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2190
>             Project: Nutch
>          Issue Type: New Feature
>          Components: crawldb
>    Affects Versions: 1.11
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.12
>
>         Attachments: NUTCH-2190.patch, NUTCH-2190.patch
>
>
> URL normalizer to normalize protocols for specified hosts/domains, e.g. normalizing http://www.apache.org/
to https://www.apache.org/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message