[jira] [Created] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

classic Classic list List threaded Threaded
25 messages Options
12
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
ludovic Boutros created SOLR-3494:
-------------------------------------

             Summary: The hunspell filter should support compressed Hunspell dictionaries
                 Key: SOLR-3494
                 URL: https://issues.apache.org/jira/browse/SOLR-3494
             Project: Solr
          Issue Type: Improvement
          Components: Schema and Analysis
    Affects Versions: 3.5, 3.6
            Reporter: ludovic Boutros


OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated SOLR-3494:
----------------------------------

    Attachment: SOLR-3494.patch

I manage to load french compressed dictionaries with this little patch.
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: SOLR-3494
>                 URL: https://issues.apache.org/jira/browse/SOLR-3494
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>    Affects Versions: 3.5, 3.6
>            Reporter: ludovic Boutros
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated SOLR-3494:
----------------------------------

    Attachment:     (was: SOLR-3494.patch)
   

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: SOLR-3494
>                 URL: https://issues.apache.org/jira/browse/SOLR-3494
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>    Affects Versions: 3.5, 3.6
>            Reporter: ludovic Boutros
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated SOLR-3494:
----------------------------------

    Attachment: SOLR-3494.patch

little changes.
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: SOLR-3494
>                 URL: https://issues.apache.org/jira/browse/SOLR-3494
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>    Affects Versions: 3.5, 3.6
>            Reporter: ludovic Boutros
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Simon Willnauer updated SOLR-3494:
----------------------------------

    Affects Version/s: 4.0
        Fix Version/s: 3.6.1
                       4.0
   

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: SOLR-3494
>                 URL: https://issues.apache.org/jira/browse/SOLR-3494
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Moved] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Simon Willnauer moved SOLR-3494 to LUCENE-4079:
-----------------------------------------------

          Component/s:     (was: Schema and Analysis)
                       modules/analysis
        Fix Version/s:     (was: 3.6.1)
                           (was: 4.0)
                       3.6.1
                       4.0
        Lucene Fields: New,Patch Available
    Affects Version/s:     (was: 3.6)
                           (was: 3.5)
                           (was: 4.0)
                       4.0
                       3.5
                       3.6
                  Key: LUCENE-4079  (was: SOLR-3494)
              Project: Lucene - Java  (was: Solr)
   

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.6, 3.5, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284378#comment-13284378 ]

Simon Willnauer commented on LUCENE-4079:
-----------------------------------------

Ludovic, I moved this to lucene since its a lucene issue.
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284383#comment-13284383 ]

Chris Male commented on LUCENE-4079:
------------------------------------

Thanks Simon
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284385#comment-13284385 ]

ludovic Boutros commented on LUCENE-4079:
-----------------------------------------

oups, yes you are right, thank you Simon.

               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment: LUCENE-4079.patch
   

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment:     (was: SOLR-3494.patch)
   

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284386#comment-13284386 ]

ludovic Boutros commented on LUCENE-4079:
-----------------------------------------

patch renamed.
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284389#comment-13284389 ]

Chris Male commented on LUCENE-4079:
------------------------------------

Ludovic, do you think you could include some tests to verify this behavior?
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Assigned] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris Male reassigned LUCENE-4079:
----------------------------------

    Assignee: Chris Male
   

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment: LUCENE-4079.patch

The patch with a test unit.

I don't know if a compressed dictionary could use both naming types (alias and direct rule name).
In the c++ code it seems that it is not possible so I did not test it in the java code.
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment:     (was: LUCENE-4079.patch)
   

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284605#comment-13284605 ]

Chris Male commented on LUCENE-4079:
------------------------------------

Ludovic,

Sorry to keep asking for changes, but do you think you could make your patch against trunk?
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284650#comment-13284650 ]

ludovic Boutros commented on LUCENE-4079:
-----------------------------------------

No problem, I will try to make the patch against trunk today.
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment: LUCENE-4079-3.6.x.patch
                LUCENE-4079-trunk.patch

Ok, I have merged some typo corrections from the trunk to the branch 3.6.
I have applied the patch to the trunk, and run the tests.

Do you need something else ? :)
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch, LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284736#comment-13284736 ]

Chris Male commented on LUCENE-4079:
------------------------------------

Thanks Ludovic, I'll take a look.
               

> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

12