[jira] [Commented] (NUTCH-1541) Indexer plugin to write CSV

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

[jira] [Commented] (NUTCH-1541) Indexer plugin to write CSV

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-1541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16533602#comment-16533602 ]

ASF GitHub Bot commented on NUTCH-1541:

sebastian-nagel commented on issue #294: NUTCH-1541 Indexer plugin to write CSV
URL: https://github.com/apache/nutch/pull/294#issuecomment-402708320
   Hi @r0ann3l, I'm happy if you could take over and make this plugin work with the new indexer plugin configuration. Feel free to clean-up the code. Thanks! It could be a useful plugin for debugging or if the data is used for data mining. Right now, it will only work in local mode. In distributed mode with big data, you would probably use a more efficient format anyway.

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[hidden email]

> Indexer plugin to write CSV
> ---------------------------
>                 Key: NUTCH-1541
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1541
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer
>    Affects Versions: 1.7
>            Reporter: Sebastian Nagel
>            Priority: Minor
>             Fix For: 1.15
>         Attachments: NUTCH-1541-v1.patch, NUTCH-1541-v2.patch
> With the new pluggable indexer a simple plugin would be handy to write configurable fields into a CSV file - for further analysis or just for export.

This message was sent by Atlassian JIRA