nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Nutch Wiki] Trivial Update of "bin/nutch_fetch" by LewisJohnMcgibbney
Date Fri, 01 Jul 2011 22:30:05 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "bin/nutch_fetch" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/bin/nutch_fetch?action=diff&rev1=5&rev2=6

Comment:
trivial update

  
  '''[-threads n]''': This arguement invokes the number of threads we wish to work concurrently
on fetching URLs in the desired segment e.g. the number of fetcher threads the fetcher should
use. This is also determines the maximum number of requests that are made at once (each fetcher
thread handles one connection).
  
- '''[-noParsing]''': If no arguement is passed this value is the default. This is the case
due to errors which can occur when parsing segments. If errors occur then the results of the
whole fetching process can be corrupted. Note that parsing will only follow meta-redirects
coming from the original URL.
+ '''[-noParsing]''': If no arguement is passed this value is the default, as set in nutch-default.xml.
This is the case due to errors which can occur when parsing segments. If parsing errors occur
then the results of the whole fetching process can be corrupted. Note that parsing will only
follow meta-redirects coming from the original URL.
  
  CommandLineOptions
  

Mime
View raw message