nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stefan GRroschupf (JIRA)" <j...@apache.org>
Subject [jira] Created: (NUTCH-2) UpdateDatabaseTool ignores url-filters
Date Sun, 20 Feb 2005 22:06:48 GMT
UpdateDatabaseTool ignores url-filters
--------------------------------------

         Key: NUTCH-2
         URL: http://issues.apache.org/jira/browse/NUTCH-2
     Project: Nutch
        Type: Bug
    Reporter: Stefan GRroschupf


UpdateDatabaseTool ignores url-filters

Under some constraints the updatedatase-tool does not
check the url-filters. So the webdb can grow with
unwanted urls which can get part of the next fetchlist. 

The patch below changes the process so that before
anything else the filters are checked. the unwanted
urls do not get part of the webdb anymore.


-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira


Mime
View raw message