nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Semyon Semyonov (JIRA)" <j...@apache.org>
Subject [jira] [Created] (NUTCH-2510) Crawl script modification. HostDb : generate, optional usage and descirption
Date Wed, 14 Feb 2018 10:33:02 GMT
Semyon Semyonov created NUTCH-2510:
--------------------------------------

             Summary: Crawl script modification. HostDb : generate, optional usage and descirption
                 Key: NUTCH-2510
                 URL: https://issues.apache.org/jira/browse/NUTCH-2510
             Project: Nutch
          Issue Type: Improvement
          Components: bin
    Affects Versions: 1.15
            Reporter: Semyon Semyonov
             Fix For: 1.14


Script crawl now includes hostdb update as a part of crawling cycle, but :
1) There is no hostdb parameter for generate

2) Generation of hostdb is not optional, therefore hostdb is generated each step without asking
of user. It should be an optional parameter.

3) Description of 1 and 2.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message