[VOTE] Apache Tika 0.9 Release Candidate #1

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
8 messages Options
Reply | Threaded
Open this post in threaded view
|

[VOTE] Apache Tika 0.9 Release Candidate #1

Mattmann, Chris A (3010)
Hi Folks,

I have posted a candidate for the Apache Tika 0.9 release. The source code
is at:

http://people.apache.org/~mattmann/apache-tika-0.9/rc1/

See the included CHANGES.txt file for details on release contents and latest
changes. The release was made using the Maven2 release plugin, according to
Jukka Zitting's notes:

http://tinyurl.com/yz2cqls

This plugin creates a Tika 0.9 tag at:

http://svn.apache.org/repos/asf/tika/tags/0.9/

And a staged M2 repository at repository.apache.org, here:

https://repository.apache.org/content/repositories/orgapachetika-061/

Please vote on releasing these packages as Apache Tika 0.9. The vote is open
for the next 72 hours. Only votes from Tika PMC are binding, but everyone
is welcome to check the release candidate and voice their approval or
disapproval. The vote passes if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache Tika 0.9.

[ ] -1 Do not release the packages because...

Thanks!

Cheers,
Chris

P.S. Here's my +1.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: [hidden email]
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 0.9 Release Candidate #1

Maxim Valyanskiy
+1

14.02.2011 08:09, Mattmann, Chris A (388J) пишет:

> Hi Folks,
>
> I have posted a candidate for the Apache Tika 0.9 release. The source code
> is at:
>
> http://people.apache.org/~mattmann/apache-tika-0.9/rc1/
>
> See the included CHANGES.txt file for details on release contents and latest
> changes. The release was made using the Maven2 release plugin, according to
> Jukka Zitting's notes:
>
> http://tinyurl.com/yz2cqls
>
> This plugin creates a Tika 0.9 tag at:
>
> http://svn.apache.org/repos/asf/tika/tags/0.9/
>
> And a staged M2 repository at repository.apache.org, here:
>
> https://repository.apache.org/content/repositories/orgapachetika-061/
>
> Please vote on releasing these packages as Apache Tika 0.9. The vote is open
> for the next 72 hours. Only votes from Tika PMC are binding, but everyone
> is welcome to check the release candidate and voice their approval or
> disapproval. The vote passes if at least three binding +1 votes are cast.
>
> [ ] +1 Release the packages as Apache Tika 0.9.
>
> [ ] -1 Do not release the packages because...
>
> Thanks!
>
> Cheers,
> Chris
>
> P.S. Here's my +1.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: [hidden email]
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>

Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 0.9 Release Candidate #1

Jukka Zitting-2
In reply to this post by Mattmann, Chris A (3010)
Hi,

On 02/14/2011 06:09 AM, Mattmann, Chris A (388J) wrote:
> Please vote on releasing these packages as Apache Tika 0.9.

+1

I think the the release is good to go as is; no need for RC #2. The
concern I raised in TIKA-596 is mostly cosmetic as Chris' earlier fix
already solved the more pressing user-visible issue.

I was also thinking of perhaps adding a client-side part to the network
parser thing I added earlier, but that's also something we can do in
time for 1.0.

--
Jukka Zitting
Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 0.9 Release Candidate #1

Julien Nioche-4
In reply to this post by Mattmann, Chris A (3010)
>
>
> Please vote on releasing these packages as Apache Tika 0.9. The vote is
> open
> for the next 72 hours. Only votes from Tika PMC are binding, but everyone
> is welcome to check the release candidate and voice their approval or
> disapproval. The vote passes if at least three binding +1 votes are cast.
>
> [ ] +1 Release the packages as Apache Tika 0.9.
>
> [ ] -1 Do not release the packages because...
>
> +1 : I tried 0.9 with Behemoth and it worked fine. As for Nutch 1.3 it
causes an issue with the zip parser plugin but I don't think Tika is to
blame. I'll fix it when 0.9 is released.

--
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 0.9 Release Candidate #1

Michael McCandless-2
In reply to this post by Mattmann, Chris A (3010)
+1 to release

I built the RC on Linux (Fedora 13)... tests all passed.  Then I used
it to pull all text from Lucene in Action 2E's manuscript in 3
different forms: the "original" separate MS Word docs per-chapter, the
per-chapter production PDFs, and the final released whole-book PDF.

I did hit the same POI bug as TIKA-589
(https://issues.apache.org/bugzilla/show_bug.cgi?id=50688) on one
of the word docs, but otherwise everything ran fine and the extracted
text looks good.

Nice work everyone,

Mike

On Mon, Feb 14, 2011 at 12:09 AM, Mattmann, Chris A (388J)
<[hidden email]> wrote:

> Hi Folks,
>
> I have posted a candidate for the Apache Tika 0.9 release. The source code
> is at:
>
> http://people.apache.org/~mattmann/apache-tika-0.9/rc1/
>
> See the included CHANGES.txt file for details on release contents and latest
> changes. The release was made using the Maven2 release plugin, according to
> Jukka Zitting's notes:
>
> http://tinyurl.com/yz2cqls
>
> This plugin creates a Tika 0.9 tag at:
>
> http://svn.apache.org/repos/asf/tika/tags/0.9/
>
> And a staged M2 repository at repository.apache.org, here:
>
> https://repository.apache.org/content/repositories/orgapachetika-061/
>
> Please vote on releasing these packages as Apache Tika 0.9. The vote is open
> for the next 72 hours. Only votes from Tika PMC are binding, but everyone
> is welcome to check the release candidate and voice their approval or
> disapproval. The vote passes if at least three binding +1 votes are cast.
>
> [ ] +1 Release the packages as Apache Tika 0.9.
>
> [ ] -1 Do not release the packages because...
>
> Thanks!
>
> Cheers,
> Chris
>
> P.S. Here's my +1.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: [hidden email]
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 0.9 Release Candidate #1

Nick Burch-4
On Tue, 15 Feb 2011, Michael McCandless wrote:
> I did hit the same POI bug as TIKA-589
> (https://issues.apache.org/bugzilla/show_bug.cgi?id=50688) on one of the
> word docs, but otherwise everything ran fine and the extracted text
> looks good.

There will hopefully be a new poi (beta) release in a few weeks time. When
that's done, we can upgrade the dependency in Tika and get the fix!

Nick
Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 0.9 Release Candidate #1

kkrugler
In reply to this post by Mattmann, Chris A (3010)

On Feb 13, 2011, at 9:09pm, Mattmann, Chris A (388J) wrote:

> Hi Folks,
>
> I have posted a candidate for the Apache Tika 0.9 release. The  
> source code
> is at:
>
> http://people.apache.org/~mattmann/apache-tika-0.9/rc1/
>
> See the included CHANGES.txt file for details on release contents  
> and latest
> changes. The release was made using the Maven2 release plugin,  
> according to
> Jukka Zitting's notes:
>
> http://tinyurl.com/yz2cqls
>
> This plugin creates a Tika 0.9 tag at:
>
> http://svn.apache.org/repos/asf/tika/tags/0.9/
>
> And a staged M2 repository at repository.apache.org, here:
>
> https://repository.apache.org/content/repositories/orgapachetika-061/
>
> Please vote on releasing these packages as Apache Tika 0.9. The vote  
> is open
> for the next 72 hours. Only votes from Tika PMC are binding, but  
> everyone
> is welcome to check the release candidate and voice their approval or
> disapproval. The vote passes if at least three binding +1 votes are  
> cast.
>
> [ ] +1 Release the packages as Apache Tika 0.9.
>
> [ ] -1 Do not release the packages because...

+1

We've been using Tika trunk for the past few weeks, and recently  
fetched/parsed 550M pages - no problems encountered. Just FYI, these  
were limited to HTML/PDF/images.

-- Ken

--------------------------
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c   w e b   m i n i n g





Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 0.9 Release Candidate #1

Alex Ott
In reply to this post by Michael McCandless-2
I built on Mac OS X 10.6 - everything work

Michael McCandless  at "Tue, 15 Feb 2011 08:34:25 -0500" wrote:
 MM> +1 to release

 MM> I built the RC on Linux (Fedora 13)... tests all passed.  Then I used
 MM> it to pull all text from Lucene in Action 2E's manuscript in 3
 MM> different forms: the "original" separate MS Word docs per-chapter, the
 MM> per-chapter production PDFs, and the final released whole-book PDF.

 MM> I did hit the same POI bug as TIKA-589
 MM> (https://issues.apache.org/bugzilla/show_bug.cgi?id=50688) on one
 MM> of the word docs, but otherwise everything ran fine and the extracted
 MM> text looks good.

--
With best wishes, Alex Ott, MBA
http://alexott.blogspot.com/        http://alexott.net/
http://alexott-ru.blogspot.com/
Skype: alex.ott