nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <>
Subject [jira] [Updated] (NUTCH-208) http: proxy exception list:
Date Thu, 21 May 2015 19:18:17 GMT


Lewis John McGibbney updated NUTCH-208:
    Attachment: NUTCH-208v2.patch

Patch for trunk. We are working with proxies right now and this issue is helpful. The correspondence
on the thread states that a Unit test would be nice, however bringing an embedded proxy into
Nutch unit testing is a project in itself. If anyone disagrees then I can go back to make
efforts to implement it. 

> http: proxy exception list:
> ---------------------------
>                 Key: NUTCH-208
>                 URL:
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 0.8, 1.3, nutchgora
>            Reporter: Matthias G√ľnter
>            Assignee: Lewis John McGibbney
>            Priority: Trivial
>              Labels: patch
>             Fix For: 1.11
>         Attachments: NUTCH-208-2.x.patch, NUTCH-208-branch-1.4-20110210-v3.patch, NUTCH-208-branch-1.4-20110807.patch,
NUTCH-208-branch-1.4-20110809-v2.patch, NUTCH-208-trunk-2.0-20110810-v2.patch, NUTCH-208-trunk-2.0-20110810.patch,
NUTCH-208.patch, NUTCH-208v2.patch, patch.txt, patch.txt, proxy_exception_list-0.8.diff
> I suggest that a parameter is added to nutch-default.xml which allows to generate a proxy
exception list. 
> <property>
>   <name>http.proxy.exception.list</name>
>   <value></value>
>   <description>URL's and hosts that don't use the proxy (e.g. intranets)</description>
> </property>
> This is useful when scanning intranet/internet combinations from behind a firewall. A
preliminary patch is added to this extend to this request, showing the changes. We will test
it and update it if necessary. this also reflects the reality in web browsers, where there
is in most cases an exception list.

This message was sent by Atlassian JIRA

View raw message