nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Nutch Wiki] Trivial Update of "NutchScoring" by LewisJohnMcgibbney
Date Sat, 20 Sep 2014 16:21:12 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "NutchScoring" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/NutchScoring?action=diff&rev1=4&rev2=5

  
  
  == What Scoring is... what it means in Nutch ==
+  * Describe CrawlDatum data structure in Nutch trunk
+ A scoring filter will manipulate scoring variables in CrawlDatum and in resulting search
indexes. Filters can be chained in a specific order, to provide multi-stage scoring adjustments.
  
  == Where Scoring takes place within the Nutch Crawl cycle ==
  
  == Scoring extension points ==
  
+  * ScoringFilter - A scoring filter will manipulate scoring variables in CrawlDatum and
in resulting search indexes. Filters can be chained in a specific order, to provide multi-stage
scoring adjustments.
+  * ScoringFilters - Creates and caches ScoringFilter implementing plugins.
+ 
  == Examples ==
   * NewScoring -- New stable pagerank like webgraph and link-analysis jobs.
   * NewScoringIndexingExample -- Two full fetch cycles of commands using new scoring and
indexing systems.
+  * AbstractScoringFilter
+  * DepthScoringFilter
+  * LinkAnalysisScoringFilter
+  * OPICScoringFilter
+  * TLDScoringFilter
+  * URLMetaScoringFilter
  
  == Development Issues ==
  FixingOpicScoring - ''In planning''.

Mime
View raw message