nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marcos Bori (JIRA)" <j...@apache.org>
Subject [jira] [Created] (NUTCH-2413) When fetching and parsing together, parameter "parse.filter.urls" is ignored
Date Fri, 25 Aug 2017 12:16:00 GMT
Marcos Bori created NUTCH-2413:
----------------------------------

             Summary: When fetching and parsing together, parameter "parse.filter.urls" is
ignored
                 Key: NUTCH-2413
                 URL: https://issues.apache.org/jira/browse/NUTCH-2413
             Project: Nutch
          Issue Type: Bug
          Components: fetcher, parser
         Environment: Apache Nutch release 1.13.
            Reporter: Marcos Bori


In a situation when we want to:
(1) Execute the fetch and parse together ("fetcher.parse" setting to "true")
(2) Avoid applying the URL filters when executing this phase.

Condition (2) can be configured when parsing is executed as a separate process by setting
"parse.filter.urls" to "false".
However, this setting ("parse.filter.urls") is ignored when we execute the fetch and parse
phases together. 




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message