nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Nutch Wiki] Trivial Update of "ErrorMessagesInNutch2" by LewisJohnMcgibbney
Date Thu, 04 Aug 2011 09:32:06 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "ErrorMessagesInNutch2" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/ErrorMessagesInNutch2?action=diff&rev1=4&rev2=5

  
  == Missing plugins whilst running Nutch 2.0 on Cloudera's CDH3 ==
  
- Cloudera's CDH3 is Cloudera's distribution including Apache Hadoop. More information can
be found [[https://ccp.cloudera.com/display/CDHDOC/CDH3+Quick+Start+Guide|here]]. This common
error results due to a bug in MAPREDUCE-967 which modifies the way MapReduce unpacks the job's
jar. The old way was to unpack the whole of it, now only classes/ and lib/ are unpacked. This
way Nutch is missing the plugins/ directory. A workaround is to force unpacking of the plugin/
directory. This can be done by adding the following properties to nutch-site.xml
+ Cloudera's CDH3 is Cloudera's distribution including Apache Hadoop. More information can
be found [[https://ccp.cloudera.com/display/CDHDOC/CDH3+Quick+Start+Guide|here]]. This common
error results due to a bug in MAPREDUCE-967 which modifies the way MapReduce unpacks the job's
jar. The old way was to unpack the whole of it, now only classes/ and lib/ are unpacked. This
way Nutch is missing the plugins/ directory. A workaround is to force unpacking of the plugin/
directory. If you install only the CDH3 distro, you are OK. It is when you add the Hue distros
or try to use a Hadoop installed with the Cloudera SCM products that you run into problems.
This can be done by adding the following properties to nutch-site.xml
  {{{
  <property>
  <name>mapreduce.job.jar.unpack.pattern</name>

Mime
View raw message