nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fredrik Andersson <fidde.anders...@gmail.com>
Subject Re: Nutch API
Date Mon, 12 Sep 2005 17:29:45 GMT
Hello Daniele!

>can nutch's functions (crawling,indexing,etc) be called from an external
> program written in java

Yes. Look at the bin/nutch script and you will see the entry points in the 
Java classes.

> If so, where can I find informations on APIs?

http://lucene.apache.org/java/docs/api/index.html
http://lucene.apache.org/nutch/apidocs/index.html

> Ho many pages can nutch actually manage? Is there a limit?

Nope. The filesystem sets the limit, and it's totally pluggable.

Greets,
Fredrik


On 9/12/05, Daniele Menozzi <menoz@ngi.it> wrote:
> 
> Hi all,I'm interested in nutch project, and it seems pretty good,but there
> are a few things I've not understood:
> 
> - can nutch's functions (crawling,indexing,etc) be called from an external
> program written in java,or I have always to use a shell script?
> If so, where can I find informations on APIs?
> - Ho many pages can nutch actually manage? Is there a limit?
> - nutch === lucene + crawling ?
> - why you do not use a database like mysql to store the data?
> 
> Thank you so much :))
> Menoz
> 
> --
> Free Software Enthusiast
> Debian Powered Linux User #332564
> http://menoz.homelinux.org
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message