nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sebastian Nagel (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2334) Extension point for schedulers
Date Thu, 24 Nov 2016 15:55:58 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15693599#comment-15693599
] 

Sebastian Nagel commented on NUTCH-2334:
----------------------------------------

Hi [~roannel],
what does "extension point for schedulers" mean?

Which implementation of the interface o.a.n.crawl.FetchSchedule is used is already configurable
via
{code}
<property>
  <name>db.fetch.schedule.class</name>
  <value>org.apache.nutch.crawl.DefaultFetchSchedule</value>
  <description>The implementation of fetch schedule. DefaultFetchSchedule simply
  adds the original fetchInterval to the last fetch time, regardless of
  page changes, whereas AdaptiveFetchSchedule (see below) tries to adapt
  to the rate at which a given page is changed. 
  </description>
</property>
{code}
Also a customized fetch scheduler can be used this way (see also o.a.n.crawl.AbstractFetchSchedule).

> Extension point for schedulers
> ------------------------------
>
>                 Key: NUTCH-2334
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2334
>             Project: Nutch
>          Issue Type: New Feature
>          Components: generator
>    Affects Versions: 1.12
>            Reporter: Roannel Fernández Hernández
>            Priority: Minor
>             Fix For: 1.13
>
>
> With an extension point for schedulers, the users should be able to create new schedulers
that meet to their own needs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message