nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "lufeng (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries
Date Fri, 03 Aug 2012 07:49:06 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13427891#comment-13427891
] 

lufeng commented on NUTCH-1405:
-------------------------------

in Injector.java (1363793) this is a problem.

if injectedSet == true && oldSet == true && overwrite == true
  result = inject

but below 

if( injectedSet && !oldSet){           // there the overwrite optional is not effected
the 
                                           the res is still the old one
   res = injected;
      } else {
        res = old;
      }

There is any problem here?
                
> Allow to overwrite CrawlDatum's with injected entries
> -----------------------------------------------------
>
>                 Key: NUTCH-1405
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1405
>             Project: Nutch
>          Issue Type: Improvement
>          Components: injector
>    Affects Versions: 1.5, 1.6
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.6
>
>         Attachments: NUTCH-1405-1.6-3.patch, NUTCH-1405-1.6-4.patch, NUTCH-1405-1.6-5.patch,
NUTCH-1405-1.6-6.patch, NUTCH-1405-1.6-7.patch
>
>
> Injector's reducer does not permit overwriting existing CrawlDatum entries. It is, however,
useful to optionally overwrite so users can reset metadata manually.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message