[jira] [Commented] (NUTCH-2631) KafkaIndexWriter

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (NUTCH-2631) KafkaIndexWriter

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-2631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16572038#comment-16572038 ]

ASF GitHub Bot commented on NUTCH-2631:
---------------------------------------

AyalCiobotaru opened a new pull request #373: fix for NUTCH-2631 contributed by AyalCiobotaru
URL: https://github.com/apache/nutch/pull/373
 
 
   Update for the Indexer plugin in order to write to Kafka. Created a KafkaIndexWriter in order to produce the crawled documents into Kafka and have Kafka distribute the messages as necessary. Ref PR #372.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


> KafkaIndexWriter
> ----------------
>
>                 Key: NUTCH-2631
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2631
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Ayal Ciobotaru
>            Priority: Minor
>              Labels: patch
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> There is no current way to index directly into Kafka in order to have a full message based system controlled by Kafka. Created a KafkaIndexWriter in order to produce the crawled documents into Kafka and have Kafka distribute the messages as necessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)