nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2373) Indexer for Hbase
Date Sun, 16 Apr 2017 18:00:43 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15970464#comment-15970464
] 

Lewis John McGibbney commented on NUTCH-2373:
---------------------------------------------

As I said before... there is already functionality to index the WebPage (0) and Host (1) objects
using GORA. If you've used HBase to implement an IndexWriter then submit your pull request
and we can review. Thanks.

(0) https://github.com/apache/nutch/blob/2.x/src/gora/webpage.avsc
(1) https://github.com/apache/nutch/blob/2.x/src/gora/host.avsc

> Indexer for Hbase
> -----------------
>
>                 Key: NUTCH-2373
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2373
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer
>    Affects Versions: 2.3
>            Reporter: Kaidul Islam
>            Assignee: Kaidul Islam
>            Priority: Minor
>             Fix For: 2.4
>
>
> Some use-case involves storing the documents in some sort of database other than indexing
search engines i.e. Solr, ElasticSearch.  This is a plugin to send the documents to Hbase
storage.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message