[jira] [Commented] (TIKA-3089) Text should be wrapped in pre-tags instead of in p-tags

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (TIKA-3089) Text should be wrapped in pre-tags instead of in p-tags

Clark Perkins (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-3089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17097247#comment-17097247 ]

Peter van der Weerd commented on TIKA-3089:
-------------------------------------------

I'm hesitating: coping with backward compatibility could also be implemented by having the PRE tags within the P, instead of replacing them.
That keeps the text/csv parsers configless.

Another option, as suggested by Nick is to create an OutputConfig with currently 1 member: wrapTextInPreTag. Or a member like 'wsyiwyg', which is more general and could be used by more parsers (pdf for instance)

Votes?

 

> Text should be wrapped in pre-tags instead of in p-tags
> -------------------------------------------------------
>
>                 Key: TIKA-3089
>                 URL: https://issues.apache.org/jira/browse/TIKA-3089
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.24
>            Reporter: Peter van der Weerd
>            Priority: Minor
>
> Currently text is treated as normal html, which causes a drama in the possible layout. Like, line-endings are not honored, font is not fixed, etc.
>  
> By wrapping in pre-tags, the layout will be much better.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)