nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Nutch Wiki] Update of "FAQ" by JulienNioche
Date Mon, 09 Jul 2012 15:35:44 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "FAQ" page has been changed by JulienNioche:

  ==== I have two XML files, nutch-default.xml and nutch-site.xml, why? ====
  nutch-default.xml is the out of the box configuration for Nutch, and most configurations
can (and should unless you know what your doing) stay as per. nutch-site.xml is where you
make the changes that override the default settings.
+ === Compiling Nutch ===
+ ==== How do I compile Nutch? ====
+ Install ANT and call 'ant' on the command line from the directory containing the Nutch source
code. Note : this won't work for the binary release for obvious reasons.
+ ==== How do I compile Nutch in Eclipse? ====
+ Nutch uses ANT+IVY to compile the code and manage the dependencies (see above). There are
instructions on how to get Nutch working with Eclipse on []
but the easiest way of doing is to use ANT for compiling and rely on Eclipse just for visualising
the code. You can also debug with Eclipse using the remote debugging and setting e.g. "export
NUTCH_OPTS=-Xdebug -agentlib:jdwp=transport=dt_socket,server=y,address=8000" prior to calling
the nutch script in /runtime/local/bin.
  === Injecting ===
  ==== What happens if I inject urls several times? ====

View raw message