nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Earl Cahill <cahi...@yahoo.com>
Subject Re: bug in bin/nutch?
Date Fri, 09 Sep 2005 09:23:45 GMT
> The DB format in mapred branch is completely
> different. So, what you 
> create with "admin db -create" is the old DB format,
> not used in the 
> mapred branch.
> 
> Please study the code to the Crawl command, this
> should help... Mapred 
> stuff is powerful, but it is also very different
> from the current way of 
> doing things, so there will be alot to learn...

Guess I figured as much.  Can I suggest that someone
typing 

bin/nutch admin ...

in the mappred branch, should get pointed to the
proper command, or at least a message saying that
admin doesn't exist in the mapred branch, just to save
some confusion.  There is a dumb patch below that
would change the usage line.

I think such differences are all the more reason to
have a nice mapred tutorial, which I would be more
than willing to help with.  I thought I was close, but
I have yet to get a mapred crawl/index/search
completed.  Your comment makes me think I am still
aways off.

Thanks,
Earl

Index: bin/nutch
===================================================================
--- bin/nutch   (revision 279734)
+++ bin/nutch   (working copy)
@@ -29,7 +29,7 @@
   echo "Usage: nutch COMMAND"
   echo "where COMMAND is one of:"
   echo "  crawl             one-step crawler for
intranets"
-  echo "  admin             database administration,
including creation"
+  echo "  admin             not used in mapred"
   echo "  inject            inject new urls into the
database"
   echo "  generate          generate new segments to
fetch"
   echo "  fetch             fetch a segment's pages"


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 

Mime
View raw message