[jira] [Commented] (LUCENE-8498) Deprecate/Remove LowerCaseTokenizer

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (LUCENE-8498) Deprecate/Remove LowerCaseTokenizer

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-8498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16614980#comment-16614980 ]

Adrien Grand commented on LUCENE-8498:
--------------------------------------

+1

> Deprecate/Remove LowerCaseTokenizer
> -----------------------------------
>
>                 Key: LUCENE-8498
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8498
>             Project: Lucene - Core
>          Issue Type: New Feature
>            Reporter: Alan Woodward
>            Priority: Major
>
> LowerCaseTokenizer combines tokenization and filtering in a way that prevents us improving the normalization API.  We should deprecate and remove it, as it can be replaced simply with a LetterTokenizer and LowerCaseFilter.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]