[jira] Created: (TIKA-208) Special characters in HTML file are not parsed correctly

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (TIKA-208) Special characters in HTML file are not parsed correctly

JIRA jira@apache.org
Special characters in HTML file are not parsed correctly
---------------------------------------------------------

                 Key: TIKA-208
                 URL: https://issues.apache.org/jira/browse/TIKA-208
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 0.3
            Reporter: Siddharth Gargate


Words containing ä, ö characters are not parsed correctly if present in HTML document.
Please refer to discussion:
http://markmail.org/message/jgwzbw63o67amqu3

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (TIKA-208) Special characters in HTML file are not parsed correctly

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/TIKA-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-208.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 0.4
         Assignee: Jukka Zitting

Fixed in revision 757751.



> Special characters in HTML file are not parsed correctly
> ---------------------------------------------------------
>
>                 Key: TIKA-208
>                 URL: https://issues.apache.org/jira/browse/TIKA-208
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Siddharth Gargate
>            Assignee: Jukka Zitting
>             Fix For: 0.4
>
>
> Words containing ä, ö characters are not parsed correctly if present in HTML document.
> Please refer to discussion:
> http://markmail.org/message/jgwzbw63o67amqu3

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.