[jira] [Resolved] (NUTCH-2479) urlmeta plugin port from 1.x to 2.x

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

[jira] [Resolved] (NUTCH-2479) urlmeta plugin port from 1.x to 2.x

David Pilato (Jira)

     [ https://issues.apache.org/jira/browse/NUTCH-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Nagel resolved NUTCH-2479.
    Resolution: Auto Closed

Closing 2.5 issues as branch is no longer maintained.

> urlmeta plugin port from 1.x to 2.x
> -----------------------------------
>                 Key: NUTCH-2479
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2479
>             Project: Nutch
>          Issue Type: New Feature
>          Components: nutch server, plugin, REST_api
>    Affects Versions: 2.3.1
>            Reporter: Ninaad Joshi
>            Priority: Minor
>              Labels: patch, plugin
>             Fix For: 2.5
>         Attachments: Ninaad.Joshi.plugin.urlmeta.patch
> I have ported urlmeta plugin available in 1.x to 2.x
> It is designed to do two things:
> * Meta Tags that are supplied with your Crawl URLs, during injection either through seed.txt or through REST API, will be propagated throughout the out-links of those Crawl URLs
> * When you index your URLs, the meta tags that you specified with your URLs will be indexed alongside those URLs--and can be directly queried, assuming you have done everything else correctly.
> I have also added support through the NutchServer REST-API. Have Attached patch along with this issue.

This message was sent by Atlassian Jira