[jira] Created: (TIKA-241) Rar archive support

classic Classic list List threaded Threaded
14 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (TIKA-241) Rar archive support

JIRA jira@apache.org
Rar archive support
-------------------

                 Key: TIKA-241
                 URL: https://issues.apache.org/jira/browse/TIKA-241
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 0.3
            Reporter: Jan Goyvaerts
            Priority: Minor


Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (TIKA-241) Rar archive support

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jan Goyvaerts updated TIKA-241:
-------------------------------

    Attachment: innosystec.rar

java-unrar jar & pom file.

The jar file is from java-unrar homepage : http://sourceforge.net/projects/java-unrar.

should be deployed in the maven repository.

> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jan Goyvaerts updated TIKA-241:
-------------------------------

    Attachment: Tika-rar.zip

Maven (netbeans) project containing the parser, test code and configuration

> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716039#action_12716039 ]

Otis Gospodnetic commented on TIKA-241:
---------------------------------------

But it looks like that jar might be might be LGPLed, no?


> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716066#action_12716066 ]

Jan Goyvaerts commented on TIKA-241:
------------------------------------

It looks like it is LGPL indeed (http://sourceforge.net/projects/java-unrar
).

I'm not familiar with every practical implication of every licensing scheme,
but if I'm correct LGPL means that the library can be used for both open
source and commercial uses. Without the obligation to open source your own
application. Am I right ?




> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716075#action_12716075 ]

Jukka Zitting commented on TIKA-241:
------------------------------------

The java-unrar-0.2.zip package comes with a fairly permissive license.txt file. Though it might be a good idea to contact the author for clarification of the licensing terms.

> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716181#action_12716181 ]

Jan Goyvaerts commented on TIKA-241:
------------------------------------

You want me to talk to the guys ? Or is this for the 'legal' Tika department
? :-)




> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716429#action_12716429 ]

Otis Gospodnetic commented on TIKA-241:
---------------------------------------

Ideally you'd talk to them and ask them to consider releasing the library under ASL v2.  Or maybe there already is something like that out there?


> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716585#action_12716585 ]

Jan Goyvaerts commented on TIKA-241:
------------------------------------

Isn't LGPL already allowing the usage of java-unrar in Tika ? So why should
they change their license ?




> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716629#action_12716629 ]

Otis Gospodnetic commented on TIKA-241:
---------------------------------------

No, I believe we cannot have copies of LGPL software in Apache.


> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12717177#action_12717177 ]

Steen Manniche commented on TIKA-241:
-------------------------------------

This is true, please see http://www.apache.org/licenses/GPL-compatibility.html for details.

> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718337#action_12718337 ]

Jan Goyvaerts commented on TIKA-241:
------------------------------------

The reply from Edmund - developer of the java-unrar library.

Hello,
unfortunately the unrar license applies to most of the junrar code. it
does not allow to rebuild the rar compression algorithm the
decompression is open source and can be/was used (see licence.txt). the
parts that are independent from unrar are lgpl, but its difficult to
separate.

Best,
Edmund




> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12724993#action_12724993 ]

Jukka Zitting commented on TIKA-241:
------------------------------------

My quick reading of the unrar license suggests that it would be acceptable as an Apache depencency. Instead of the partial LGPL licensing, would the java-unrar author willing to license the entire library under the unrar license?

> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (TIKA-241) Rar archive support

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting updated TIKA-241:
-------------------------------

    Attachment: java-unrar-src.zip

Jan asked for and got a relicensed version of the java-unrar library. I'm attaching it here for the record.

I filed LEGAL-52 to get official clearance on the license terms.

> Rar archive support
> -------------------
>
>                 Key: TIKA-241
>                 URL: https://issues.apache.org/jira/browse/TIKA-241
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Jan Goyvaerts
>            Priority: Minor
>         Attachments: innosystec.rar, java-unrar-src.zip, Tika-rar.zip
>
>
> Support for parsing .rar files.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.