nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ferdy Galema (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (NUTCH-1289) In distributed mode URL's are not partitioned
Date Mon, 05 Mar 2012 13:01:57 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ferdy Galema updated NUTCH-1289:
--------------------------------

    Attachment: NUTCH-1289-v2.patch

Done with patch v2. It fixes the problem as described above. It also features a minor improvement,
namely that the partition code will be skipped entirely when there is just one partition.
(For example in local mode.)

It includes several tests, including the seed function, the different modes and signature
partitioners.
                
> In distributed mode URL's are not partitioned
> ---------------------------------------------
>
>                 Key: NUTCH-1289
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1289
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: nutchgora
>            Reporter: Dan Rosher
>             Fix For: nutchgora
>
>         Attachments: NUTCH-1289-v2.patch, NUTCH-1289.patch
>
>
> In distributed mode URL's are not partitioned to a specific machine which means the politeness
policy is voided

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message