nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julien Nioche (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries
Date Wed, 04 Jul 2012 14:46:35 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13406564#comment-13406564
] 

Julien Nioche commented on NUTCH-1405:
--------------------------------------

the way I was thinking about it was that overwrite had precedence over update, i.e update
happens only of overwrite is false. if both are true - dump a log message but do overwrite.
                
> Allow to overwrite CrawlDatum's with injected entries
> -----------------------------------------------------
>
>                 Key: NUTCH-1405
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1405
>             Project: Nutch
>          Issue Type: Improvement
>          Components: injector
>    Affects Versions: 1.5, 1.6
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.6
>
>         Attachments: NUTCH-1405-1.6-3.patch, NUTCH-1405-1.6-4.patch, NUTCH-1405-1.6-5.patch,
NUTCH-1405-1.6-6.patch
>
>
> Injector's reducer does not permit overwriting existing CrawlDatum entries. It is, however,
useful to optionally overwrite so users can reset metadata manually.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message