Encoding issue in solr

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Encoding issue in solr

UMA MAHESWAR
HI ALL,

while i am using nutch for crawling and indexing in to solr,while storing
data in to solr encoding issue facing


in site  having the title

title : ebm-papst Motoren & Ventilatoren GmbH - Axialventilatoren und
Radialventilatoren aus Linz, Österreich

but in solr storing in the below format

title": "ebm-papst Motoren & Ventilatoren GmbH - Axialventilatoren und
Radialventilatoren aus Linz, Österrei",

suggest me how to store actual data in to solr .

thanks for your suggestions.




--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
Reply | Threaded
Open this post in threaded view
|

Re: Encoding issue in solr

Tim Allison
This is probably caused by an encoding detection problem in Nutch and/or
Tika. If you can share the file on the Tika user’s list, I can take a look.

On Fri, Oct 5, 2018 at 7:11 AM UMA MAHESWAR <[hidden email]>
wrote:

> HI ALL,
>
> while i am using nutch for crawling and indexing in to solr,while storing
> data in to solr encoding issue facing
>
>
> in site  having the title
>
> title : ebm-papst Motoren & Ventilatoren GmbH - Axialventilatoren und
> Radialventilatoren aus Linz, Österreich
>
> but in solr storing in the below format
>
> title": "ebm-papst Motoren & Ventilatoren GmbH - Axialventilatoren und
> Radialventilatoren aus Linz, Österrei",
>
> suggest me how to store actual data in to solr .
>
> thanks for your suggestions.
>
>
>
>
> --
> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
>