[jira] [Resolved] (SOLR-7193) Concatenate words from token stream

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Resolved] (SOLR-7193) Concatenate words from token stream

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-7193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

David Smiley resolved SOLR-7193.
--------------------------------
    Resolution: Duplicate

> Concatenate words from token stream
> -----------------------------------
>
>                 Key: SOLR-7193
>                 URL: https://issues.apache.org/jira/browse/SOLR-7193
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: Abhishek Bafna
>            Priority: Major
>         Attachments: concatenate_words.patch
>
>
> The user entered data often don't have proper spacing between words and words spelling and format also varies from data like business names, address etc. After tokenizing data, we might perform pattern replacement, stop word filtering etc. Later we want to concatenate all the tokens and generate n-grams token for indexing business name and perform the fuzzy match.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]