nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sebastian Nagel (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2333) Indexer for RabbitMQ
Date Wed, 18 Jan 2017 15:06:26 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828234#comment-15828234
] 

Sebastian Nagel commented on NUTCH-2333:
----------------------------------------

+1 looks good, although I haven't tested it.

Yes, there is some overlap between indexer-rabbit and publish-rabbitmq, mostly regarding configuration
of and connection to RabbitMQ. Eventually, code could be shared in a lib-rabbitmq plugin,
now or as a later improvement.

To implement indexer-rabbit as a NutchPublisher seems difficult: the IndexWriter and NutchPublisher
interfaces are different, esp. how objects are serialized (a specific object "NutchDocument"
vs. an unknown but universally JSON serializable object). Of course, one could think of indexing
as a event, but in reality it's likely that different consumers/queues are used for monitoring
and indexing content.

> Indexer for RabbitMQ
> --------------------
>
>                 Key: NUTCH-2333
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2333
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer
>    Affects Versions: 1.12
>            Reporter: Roannel Fernández Hernández
>            Priority: Minor
>             Fix For: 1.13
>
>
> A plugin to send the documents to a RabbitMQ server.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message