[jira] [Comment Edited] (TIKA-3098) Detecting embedded image

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Comment Edited] (TIKA-3098) Detecting embedded image

Clark Perkins (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17101588#comment-17101588 ]

Tim Allison edited comment on TIKA-3098 at 5/7/20, 11:55 AM:
-------------------------------------------------------------

There's a thumbnail under docProps.  If you use /rmeta or the RecursiveParserWrapper, you'll see this:

"embeddedRelationshipId": "/docProps/thumbnail.jpeg"

We _should_ update the ooxml code to tag this image as type thumbnail: https://tika.apache.org/1.24.1/api/org/apache/tika/metadata/TikaCoreProperties.EmbeddedResourceType.html#THUMBNAIL


was (Author: [hidden email]):
There's a thumbnail under docProps.  If you use /rmeta or the RecursiveParserWrapper, you'll see this:

"embeddedRelationshipId": "/docProps/thumbnail.jpeg"

We _should_ update the ooxml code to tag this image as type Inline: https://tika.apache.org/1.24/api/org/apache/tika/metadata/TikaCoreProperties.EmbeddedResourceType.html#INLINE

> Detecting embedded image
> ------------------------
>
>                 Key: TIKA-3098
>                 URL: https://issues.apache.org/jira/browse/TIKA-3098
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.24
>            Reporter: suchendra
>            Priority: Minor
>         Attachments: test copy.potx
>
>
> I am trying to detect the embedded image using apache tika, I have a simple java code and I am using EmbeddedDocumentExtractor to detect the embedded image. 
> There is no image as I could see, but tika is detecting the embedded image.
> I have attached the file for the reference. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)