[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890848#action_12890848 ]

Stanislaw Osinski commented on SOLR-1804:

Essentially, the dependency on the smart chinese is optional in a sense that the lack of it will degrade the quality of clustering in Chinese, but will not break it. Let me see if I can make it optionally loadable in LuceneLanguageModelFactory too.

I think we could handle this in a similar way as in Carrot2: attempt to load chinese tokenizer and fall back to the default one in case of class loading exceptions. The easiest implementation route would be to include smart chinese as a dependency during compilation of the clustering plugin with an understanding that the library may or may not be available during runtime. Is that possible with the current Solr compilation scripts?

> Upgrade Carrot2 to 3.2.0
> ------------------------
>                 Key: SOLR-1804
>                 URL: https://issues.apache.org/jira/browse/SOLR-1804
>             Project: Solr
>          Issue Type: Improvement
>          Components: contrib - Clustering
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>         Attachments: SOLR-1804-carrot2-3.4.0-dev-libs.zip, SOLR-1804-carrot2-3.4.0-dev.patch
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]