[jira] [Updated] (NUTCH-2249) WordNet Integration for Cosine Similarity

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

[jira] [Updated] (NUTCH-2249) WordNet Integration for Cosine Similarity

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Nagel updated NUTCH-2249:
    Fix Version/s:     (was: 1.15)

> WordNet Integration for Cosine Similarity
> -----------------------------------------
>                 Key: NUTCH-2249
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2249
>             Project: Nutch
>          Issue Type: New Feature
>          Components: plugin, scoring
>            Reporter: Bhavya Sanghavi
>            Assignee: Sujen Shah
>            Priority: Minor
>              Labels: memex
> Integrated WordNet database to enhance the cosine similarity plugin.
> This helps in reducing the size of the vectors for calculating the cosine similarity by mapping the synonymous words to the same entry in the vector. Consequently, it would increase the accuracy of the scores given to the webpages to be crawled.

This message was sent by Atlassian JIRA