[jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

[jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode

Mihir Sharma (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-3104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17123801#comment-17123801 ]

Tim Allison commented on TIKA-3104:

[~Tamane] I have no idea if I was decoding it properly because I don't understand what the binary is encoding. I did call Tika's AutoDetectParser on the data.getBytes(), but that was useless, because as you point out this is some proprietary graph info.

I would encourage downloading url content in a separate process than running Tika.  If you must do both at once, try something like this.

        Parser p = new AutoDetectParser();
        Metadata m = new Metadata();
        URL url = new URL("https://issues.apache.org/jira/secure/attachment/13004188/DeepScroll_Example%5B4988%5D.memgraph");
        ContentHandler contentHandler = new ToXMLContentHandler();
        try (TikaInputStream tis = TikaInputStream.get(url)) {
            p.parse(tis, contentHandler, m, new ParseContext());

> Detection of memgraph files exported from Xcode
> -----------------------------------------------
>                 Key: TIKA-3104
>                 URL: https://issues.apache.org/jira/browse/TIKA-3104
>             Project: Tika
>          Issue Type: Wish
>          Components: core
>    Affects Versions: 1.24
>            Reporter: Parth
>            Assignee: Tim Allison
>            Priority: Major
>              Labels: detection, features, new-parser
>         Attachments: DeepScroll_Example[4988].memgraph, memgraph.xml, out.memgraph.json, out.memgraph.xhtml
> I wanted to detect a memgraph file linked by a url. But currently detection of memgraph file is not supported. I tried adding to custom-mimetypes but that did not help.  
> <mime-info>
>  <mime-type type="application/memgraph">
>  <glob pattern="*.memgraph"/>
>  </mime-type>
> </mime-info>

This message was sent by Atlassian Jira