nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Nutch Wiki] Trivial Update of "RunNutchInEclipse" by LewisJohnMcgibbney
Date Thu, 10 Nov 2011 00:46:51 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "RunNutchInEclipse" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/RunNutchInEclipse?action=diff&rev1=26&rev2=27

  
  
  === Configure Nutch ===
-  * see the [[http://wiki.apache.org/nutch/NutchTutorial|Tutorial]] and follow all configuration
steps, ensure that you '''DO NOT''' undertake any crawling. The directory structure for Nutch-1.3
enables us to edit nutch-site.xml.template, nutch-default.xml and regex-urlfilter.txt.template
in our /conf directory, these properties will then be automatically built into our /runtime
build folder.
+  * see the [[http://wiki.apache.org/nutch/NutchTutorial|Tutorial]] and follow all configuration
steps, ensure that you '''DO NOT''' undertake any crawling. The directory structure for Nutch
trunk enables us to edit nutch-site.xml.template, nutch-default.xml and regex-urlfilter.txt.template
in our /conf directory, these properties will then be automatically built into our /runtime
build folder.
   * ensure that you change the property "plugin.folders" to "./src/plugin" on $NUTCH_HOME/conf/nutch-site.xml
-  * Once we have ensured that Nutch 1.3 is correctly configured we can progress to building
within Eclipse.
+  * Once we have ensured that Nutch trunk is correctly configured we can progress to building
within Eclipse.
  
  === Build Nutch ===
-  * We can now progress to building Nutch by simply dragging the build.xml file into the
Ant perspective and double clicking on the build file. If you configured the project correctly,
Eclipse will build Nutch for you into "tmp_build" and you should see something similar to
the following:
+  * We can now progress to building Nutch by simply dragging the build.xml file into the
Ant perspective and double clicking on the build file. If you configured the project correctly,
Eclipse will build Nutch for you into "bin" and you should see something similar to the following:
  {{{
-  [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/deploy
-      [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/deploy/bin
-      [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/local/lib
-      [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/local/lib/native
-      [copy] Copying 20 files to /home/lewis/apache-nutch-1.3/runtime/local/conf
-      [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/local/bin
-      [copy] Copying 42 files to /home/lewis/apache-nutch-1.3/runtime/local/lib
-      [copy] Copying 107 files to /home/lewis/apache-nutch-1.3/runtime/local/plugins
  BUILD SUCCESSFUL
  Total time: 33 seconds
  }}}
@@ -115, +107 @@

  {{{
  <property>
    <name>plugin.folders</name>
-   <value>/home/....../nutch-1.3/src/plugin</value>
+   <value>/home/....../trunk/src/plugin</value>
  }}}
  
  === No plugins loaded during unit tests in Eclipse ===
@@ -127, +119 @@

  
  === debugging Hadoop classes ===
  Sometimes (fairly often) it makes sense to also have the Hadoop classes available during
debugging. This should really second nature as Nutch heavily relies upon the underlying Hadoop
infrastructure. Therefore you can check out (svn) the Hadoop sources into your Eclipse IDE
and combine to debug this way. You can:
-   * Checkout the Hadoop version that should be used within Nutch 1.3
+   * Checkout the Hadoop version that should be used within Nutch trunk
    * configure a Hadoop project similar to the Nutch project within your Eclipse IDE
    * add the Hadoop project as a dependent project of Nutch project
    * you can now also set break points within Hadoop classes like inputformat implementations
etc.

Mime
View raw message