nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <>
Subject [jira] [Commented] (NUTCH-2631) KafkaIndexWriter
Date Fri, 03 Aug 2018 15:36:00 GMT


ASF GitHub Bot commented on NUTCH-2631:

sebastian-nagel commented on issue #372: NUTCH-2631 KafkaIndexWriter
   Hi @AyalCiobotaru, could you provide a PR for master? A PR against a release branch is
useless, simply because 1.14 has already been released and cannot be changed afterwards. Thanks!

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:

> KafkaIndexWriter
> ----------------
>                 Key: NUTCH-2631
>                 URL:
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Ayal Ciobotaru
>            Priority: Minor
>              Labels: patch
>   Original Estimate: 168h
>  Remaining Estimate: 168h
> There is no current way to index directly into Kafka in order to have a full message
based system controlled by Kafka. Created a KafkaIndexWriter in order to produce the crawled
documents into Kafka and have Kafka distribute the messages as necessary.

This message was sent by Atlassian JIRA

View raw message