[jira] Created: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

classic Classic list List threaded Threaded
6 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

JIRA jira@apache.org
add HyphenationCompoundWordTokenFilterFactory class
---------------------------------------------------

                 Key: SOLR-1984
                 URL: https://issues.apache.org/jira/browse/SOLR-1984
             Project: Solr
          Issue Type: New Feature
            Reporter: P B
            Priority: Critical
         Attachments: HyphenationCompoundWordTokenFilterFactory.java

Please can you include my contribution into Solr night builds.

I can not compile on Linux server, I have tested only on Windows.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

P B updated SOLR-1984:
----------------------

    Attachment: HyphenationCompoundWordTokenFilterFactory.java

source code

> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: P B
>            Priority: Critical
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Uwe Schindler updated SOLR-1984:
--------------------------------

    Fix Version/s: 3.1
                   4.0
         Priority: Minor  (was: Critical)
      Component/s: Schema and Analysis

> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: P B
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Assigned: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir reassigned SOLR-1984:
---------------------------------

    Assignee: Robert Muir

> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: P B
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir updated SOLR-1984:
------------------------------

    Attachment: SOLR-1984.patch

Thank you very much for contributing this, its true there is no factory for this feature.

I updated your code with a few tweaks:
* allow null dictionary. This allows the use of just the hyphenation grammar (LUCENE-1287)
* allow encoding to be specified (but default to UTF-8). Some of the grammar distributions from offo dont use UTF-8 encoding.
* set onlyLongestMatch default to 'false'. this is just to be consistent with the TokenFilter itself, which defaults to false.
* added the Apache-licensed danish grammar to test-files, along with a small dictionary and some test cases.

if no one objects, i'll commit in a bit.


> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: P B
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java, SOLR-1984.patch
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir resolved SOLR-1984.
-------------------------------

    Resolution: Fixed

Committed revision 962555, 962559 (3x)

> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: P B
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java, SOLR-1984.patch
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]