We have discussed it but not implemented it. A previous step before
implementing interfaces to use HBase for current Nutch databases was to
may the Nutch architecture itself more flexible. This is what I have
been terming Nutch 2 and what I have been currently working on.
Dennis
Marcus Herou wrote:
> Hi.
>
> Anyone tried to implement HBase as storage for:
>
> * CrawlDB
> * LinkDB
> * Fetched and parsed url data
>
> It would certainly be cool I think to be able to search in all these three
> db's. Currently it is a little bit hard to use the data crawled without
> actually indexing it.
>
> Kindly
>
> //Marcus
>
>
>
|