[jira] Created: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

classic Classic list List threaded Threaded
18 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
Upgrade nutch to use released apache-tika-0.1-incubating
--------------------------------------------------------

                 Key: NUTCH-608
                 URL: https://issues.apache.org/jira/browse/NUTCH-608
             Project: Nutch
          Issue Type: Improvement
          Components: mime_type_detector
            Reporter: Chris A. Mattmann
            Assignee: Chris A. Mattmann
             Fix For: 1.0.0


This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567384#action_12567384 ]

Chris A. Mattmann commented on NUTCH-608:
-----------------------------------------

If there are no objections, I'd like to commit this patch within the next 24 hrs.

Thanks,
 Chris


> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Work started: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on NUTCH-608 started by Chris A. Mattmann.

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567395#action_12567395 ]

Chris A. Mattmann commented on NUTCH-608:
-----------------------------------------

Sorry folks, the patch didn't go through the first time, just noticed this. Will attach now.

Also, I'll extend the open time for patch review from 24 hrs to let's say a week. While upgrading to the released version of Tika, I noticed that I had to update the APIs in a few key places. It seems to be passing basic crawls, and unit tests right now, but I'd be much more happy if someone with a great test bed like Dennis (or others) tries the patch out and provides feedback.

Thanks,
 Chris


> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated NUTCH-608:
------------------------------------

    Attachment: tika-0.1-incubating.jar

apache tika 0.1-incubating

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated NUTCH-608:
------------------------------------

    Attachment: NUTCH-608.Mattmann.021008.patch.txt

Initial patch, horrendously late :)

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567665#action_12567665 ]

Andrzej Bialecki  commented on NUTCH-608:
-----------------------------------------

This patch includes many whitespace-only changes. It's better to submit such changes separately, because they cause the patch to be much larger than necessary and difficult to review.

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated NUTCH-608:
------------------------------------

    Attachment: NUTCH-608.Mattmann.021108.patch.txt

- updated patch, removes unintentional white space changes.

Thanks for the review, Andrzej!

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567706#action_12567706 ]

Andrzej Bialecki  commented on NUTCH-608:
-----------------------------------------

One additional comment, now that the important changes are visible ;) Since you add a util-type class anyway (MimeUtils), why not encapsulate all interactions with Tika inside this class? This way we can protect the rest of Nutch code from future changes in Tika API, and we can avoid adding Tika imports to various classes ...

The class could be patterned after many other similar classes, having a constructor like MimeUtils(Configuration conf). Then it can wrap all the initialization code, string splitting and the fallback strategies without exposing any Tika classes.

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated NUTCH-608:
------------------------------------

    Attachment: NUTCH-608.Mattmann.021108.patch.v2.txt

Hi Andrzej:

Good idea. The facade interface is attached, along with all the appropriate hooks in the latest patch. Comments?

Cheers,
 Chris


> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567726#action_12567726 ]

Andrzej Bialecki  commented on NUTCH-608:
-----------------------------------------

Looks great. +1

However, this patch uncovered two minor bugs - the use of "static" keyword in MoreIndexingFilter, ZipExtractor and FileResponse. IMO this should never be static, because MIME attribute may be initialized differently depending on the current Configuration. The bugs are minor because we reuse the same JVM only in the "local" mode, where jobs are started usually with the same Configuration. We can track these bugs in a separate issue if you prefer.

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated NUTCH-608:
------------------------------------

    Attachment: NUTCH-608.Mattmann.021108.patch.v3.txt

Hi Andrzej,

Thanks for your comments. I've removed the static keywords, and attached an updated patch. Does this address all of your concerns?

Furthermore, if it does, are there any other objections from any of the committers, and can I go ahead and commit this patch?

Thanks,
 Chris


> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, NUTCH-608.Mattmann.021108.patch.v3.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567749#action_12567749 ]

Andrzej Bialecki  commented on NUTCH-608:
-----------------------------------------

You missed one in Content ... other than that, +1.

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, NUTCH-608.Mattmann.021108.patch.v3.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated NUTCH-608:
------------------------------------

    Attachment: NUTCH-608.Mattmann.021108.patch.v4.txt

For completeness sake, an attached patch with the missed Content static ref taken out.

If there are no further objections, I'd like to commit this sometime in the next 24 hrs.

Thanks for the thorough review Andrzej! :)

It feels good to do some Nutch development again ;)

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, NUTCH-608.Mattmann.021108.patch.v3.txt, NUTCH-608.Mattmann.021108.patch.v4.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567823#action_12567823 ]

Dennis Kubes commented on NUTCH-608:
------------------------------------

+1 looks good.

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, NUTCH-608.Mattmann.021108.patch.v3.txt, NUTCH-608.Mattmann.021108.patch.v4.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann resolved NUTCH-608.
-------------------------------------

    Resolution: Fixed

- added MimeUtil facade class to insulate Nutch from underlying mime type detector, Tika
- cleaned up static refs to MimeUtil/MimeType in Content/index-more/parse-zip/protocol-file
- upgrade to tika-0.1-incubating


> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, NUTCH-608.Mattmann.021108.patch.v3.txt, NUTCH-608.Mattmann.021108.patch.v4.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Closed: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann closed NUTCH-608.
-----------------------------------


- Patch applied to trunk:

http://svn.apache.org/viewvc?rev=620811&view=rev

Thanks for the reviews, everyone!

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, NUTCH-608.Mattmann.021108.patch.v3.txt, NUTCH-608.Mattmann.021108.patch.v4.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12568421#action_12568421 ]

Hudson commented on NUTCH-608:
------------------------------

Integrated in Nutch-trunk #360 (See [http://hudson.zones.apache.org/hudson/job/Nutch-trunk/360/])

> Upgrade nutch to use released apache-tika-0.1-incubating
> --------------------------------------------------------
>
>                 Key: NUTCH-608
>                 URL: https://issues.apache.org/jira/browse/NUTCH-608
>             Project: Nutch
>          Issue Type: Improvement
>          Components: mime_type_detector
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-608.Mattmann.021008.patch.txt, NUTCH-608.Mattmann.021108.patch.txt, NUTCH-608.Mattmann.021108.patch.v2.txt, NUTCH-608.Mattmann.021108.patch.v3.txt, NUTCH-608.Mattmann.021108.patch.v4.txt, tika-0.1-incubating.jar
>
>
> This patch will upgrade Nutch to use the released tika-0.1-incubating jar containing stable APIs and code, as opposed to the -dev version of the jar file that's currently in place in SVN.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.