nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Nutch Wiki] Trivial Update of "Incremental Crawling Scripts Test" by Gabriele Kahlout
Date Sun, 27 Mar 2011 13:47:11 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "Incremental Crawling Scripts Test" page has been changed by Gabriele Kahlout.
http://wiki.apache.org/nutch/Incremental%20Crawling%20Scripts%20Test?action=diff&rev1=1&rev2=2

--------------------------------------------------

  rm: crawl/new_indexes: No such file or directory
  bin/nutch index crawl/new_indexes crawl/crawldb crawl/linkdb crawl/segments/20110327152839
  Indexer: starting at 2011-03-27 15:29:39
- content:4.0 while state.getLength():4 norm:0.25
- host:1.0 while state.getLength():1 norm:1.0
- site:1.0 while state.getLength():1 norm:1.0
- title:1.0 while state.getLength():0 norm:1.0
- url:7.0 while state.getLength():7 norm:0.14285715
- content:4.0 while state.getLength():4 norm:0.25
- host:1.0 while state.getLength():1 norm:1.0
- site:1.0 while state.getLength():1 norm:1.0
- title:1.0 while state.getLength():0 norm:1.0
- url:7.0 while state.getLength():7 norm:0.14285715
  Indexer: finished at 2011-03-27 15:29:57, elapsed: 00:00:18
  
  bin/nutch merge crawl/temp_indexes/part-1 crawl/indexes crawl/new_indexes

Mime
View raw message