[jira] [Commented] (TIKA-2479) Handle empty cells in tables uniformly

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (TIKA-2479) Handle empty cells in tables uniformly

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16477426#comment-16477426 ]

Tim Allison commented on TIKA-2479:
-----------------------------------

+1 to [~gagravarr]'s proposal.  xls, xlsx and xlsb :)

> Handle empty cells in tables uniformly
> --------------------------------------
>
>                 Key: TIKA-2479
>                 URL: https://issues.apache.org/jira/browse/TIKA-2479
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: patch.diff
>
>
> It looks like we output a <td/> for empty cells in xls, and tables in doc, docx and pptx.  However, we don't retain empty cells in xlsx or tables in ppt.  We should make this handling uniform.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)