[jira] [Commented] (TIKA-3098) Detecting embedded image

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

[jira] [Commented] (TIKA-3098) Detecting embedded image

Sebastian Nagel (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17101588#comment-17101588 ]

Tim Allison commented on TIKA-3098:

There's a thumbnail under docProps.  If you use /rmeta or the RecursiveParserWrapper, you'll see this:

"embeddedRelationshipId": "/docProps/thumbnail.jpeg"

We _should_ update the ooxml code to tag this image as type Inline: https://tika.apache.org/1.24/api/org/apache/tika/metadata/TikaCoreProperties.EmbeddedResourceType.html#INLINE

> Detecting embedded image
> ------------------------
>                 Key: TIKA-3098
>                 URL: https://issues.apache.org/jira/browse/TIKA-3098
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.24
>            Reporter: suchendra
>            Priority: Minor
>         Attachments: test copy.potx
> I am trying to detect the embedded image using apache tika, I have a simple java code and I am using EmbeddedDocumentExtractor to detect the embedded image. 
> There is no image as I could see, but tika is detecting the embedded image.
> I have attached the file for the reference. 

This message was sent by Atlassian Jira