nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <>
Subject [jira] [Created] (NUTCH-1346) Follow outlinks to ignore external
Date Tue, 24 Apr 2012 14:03:36 GMT
Markus Jelsma created NUTCH-1346:

             Summary: Follow outlinks to ignore external
                 Key: NUTCH-1346
             Project: Nutch
          Issue Type: Improvement
          Components: fetcher
    Affects Versions: 1.5
            Reporter: Markus Jelsma
            Assignee: Markus Jelsma
             Fix For: 1.6

The follow outlinks feature already respects the db.ignore.external.links setting. However,
this means that outlinks of fetched pages that are external are not saved in parse data. There
should be a new setting to prevent the outlink follower from going external but still storing
external outlinks.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message