[jira] [Commented] (TIKA-2264) Better handling of footnotes/endnotes for ODF files

Previous Topic Next Topic
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

[jira] [Commented] (TIKA-2264) Better handling of footnotes/endnotes for ODF files

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15866270#comment-15866270 ]

Tim Allison commented on TIKA-2264:

bq. Please bear in mind that I'm also something of a newb

You should have seen my early contributions...you'd feel much better.  Actually, take a look at my recent ones, too. :)

> Better handling of footnotes/endnotes for ODF files
> ---------------------------------------------------
>                 Key: TIKA-2264
>                 URL: https://issues.apache.org/jira/browse/TIKA-2264
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.14
>         Environment: N/A
>            Reporter: Mike Rodent
>            Priority: Minor
>              Labels: newbie
>         Attachments: ImprovedODFContentParser.java, _ImprovedODFContentParserUTest.java, test.odt
> Springs from my question here (http://stackoverflow.com/questions/42031237/modify-apache-tika-parsing-of-old-1997-2003-ms-word-docs) ... I have improved the class OpenDocumentContentParser so that it puts footnotes/endnotes at the end of the line to which they belong and doesn't break up the line in question.  As with .docx parsing the notes can be linked to the reference easily.  The respondee in Stack Overflow suggested I open an issue here...

This message was sent by Atlassian JIRA