nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2631) KafkaIndexWriter
Date Thu, 02 Aug 2018 16:56:00 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16567066#comment-16567066
] 

ASF GitHub Bot commented on NUTCH-2631:
---------------------------------------

AyalCiobotaru opened a new pull request #372: NUTCH-2631 KafkaIndexWriter
URL: https://github.com/apache/nutch/pull/372
 
 
   Update for the Indexer plugin in order to write to Kafka. Created a KafkaIndexWriter in
order to produce the crawled documents into Kafka and have Kafka distribute the messages as
necessary.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> KafkaIndexWriter
> ----------------
>
>                 Key: NUTCH-2631
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2631
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Ayal Ciobotaru
>            Priority: Minor
>              Labels: patch
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> There is no current way to index directly into Kafka in order to have a full message
based system controlled by Kafka. Created a KafkaIndexWriter in order to produce the crawled
documents into Kafka and have Kafka distribute the messages as necessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message