[VOTE] Apache Tika 1.11 Release Candidate #1

classic Classic list List threaded Threaded
7 messages Options
Reply | Threaded
Open this post in threaded view
|

[VOTE] Apache Tika 1.11 Release Candidate #1

Mattmann, Chris A (3010)
Hi Folks,

A first candidate for the Tika 1.11 release is available at:

  https://dist.apache.org/repos/dist/dev/tika/

The release candidate is a zip archive of the sources in:
  http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/

The SHA1 checksum of the archive is
d0dde7b3a4f1a2fb6ccd741552ea180dddab630a

In addition, a staged maven repository is available here:

https://repository.apache.org/content/repositories/orgapachetika-1014/


Please vote on releasing this package as Apache Tika 1.11.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.

[ ] +1 Release this package as Apache Tika 1.11
[ ] -1 Do not release this package because…

Cheers,
Chris

P.S. Of course here is my +1.



++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: [hidden email]
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++



Reply | Threaded
Open this post in threaded view
|

RE: [VOTE] Apache Tika 1.11 Release Candidate #1

Allison, Timothy B.
+0 (some regressions in ppt content)

I just finished the batch comparison run on  ~1.8 million files in our govdocs1 and commoncrawl corpora comparing Tika 1.10 to 1.11-rc1.  As a caveat, the eval code is still in development and there may be bugs in the reports.

Results are here: https://github.com/tballison/share/blob/master/tika_comparisons/tika_1_10_vs_1_11-rc1.zip 

Key reports:
contents/content_diffs.csv (file had one corrupt row when viewing in Excel...manually deleted offending content)
exceptions/newExceptionsInBByMimeTypeByStackTrace.csv (small handful)
exceptions/fixedExceptionsInBByMimeType.csv  (none!)
mimes/mime_diffs_A_to_B.csv

On the positive side:
From "mime_diffs_A_to_B.csv", it looks like we are catching more pdfs as pdfs (that text/xhtml) than we were...great!  We're identifying more files as images (jpeg, pict) than as xhtml, and, from a quick look, this appears to be an improvement.  We have at least 9 new x-hwp-v5 (great!).

On the negative side:

1) We have a few regressions in ppt exceptions (six of the same aioobe).
2) We have regressions in ppt content (it looks like we're not adding a new line/word break where we need to).  The regressions are small per file, but they affect ~220 ppts out of ~1500 (~15%).

Other than the regressions in ppt content, I'd be +1, but I don't think this is severe enough to warrant a re-spin.  Happy to look into a fix, though, if we want a re-spin...and even if we don't, I'll start looking into this asap.

-----Original Message-----
From: Mattmann, Chris A (3980) [mailto:[hidden email]]
Sent: Monday, October 19, 2015 10:23 AM
To: [hidden email]
Cc: [hidden email]
Subject: [VOTE] Apache Tika 1.11 Release Candidate #1

Hi Folks,

A first candidate for the Tika 1.11 release is available at:

  https://dist.apache.org/repos/dist/dev/tika/

The release candidate is a zip archive of the sources in:
  http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/

The SHA1 checksum of the archive is
d0dde7b3a4f1a2fb6ccd741552ea180dddab630a

In addition, a staged maven repository is available here:

https://repository.apache.org/content/repositories/orgapachetika-1014/


Please vote on releasing this package as Apache Tika 1.11.
The vote is open for the next 72 hours and passes if a majority of at least three +1 Tika PMC votes are cast.

[ ] +1 Release this package as Apache Tika 1.11 [ ] -1 Do not release this package because…

Cheers,
Chris

P.S. Of course here is my +1.



++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: [hidden email]
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++



Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 1.11 Release Candidate #1

Tyler Palsulich
+1 from me -- builds, tests pass, sanity check files parse, and sums look
good. But, I get a warning that the signature is not certified with a
trusted signature.

Tyler

On Wed, Oct 21, 2015 at 6:43 AM Allison, Timothy B. <[hidden email]>
wrote:

> +0 (some regressions in ppt content)
>
> I just finished the batch comparison run on  ~1.8 million files in our
> govdocs1 and commoncrawl corpora comparing Tika 1.10 to 1.11-rc1.  As a
> caveat, the eval code is still in development and there may be bugs in the
> reports.
>
> Results are here:
> https://github.com/tballison/share/blob/master/tika_comparisons/tika_1_10_vs_1_11-rc1.zip
>
> Key reports:
> contents/content_diffs.csv (file had one corrupt row when viewing in
> Excel...manually deleted offending content)
> exceptions/newExceptionsInBByMimeTypeByStackTrace.csv (small handful)
> exceptions/fixedExceptionsInBByMimeType.csv  (none!)
> mimes/mime_diffs_A_to_B.csv
>
> On the positive side:
> From "mime_diffs_A_to_B.csv", it looks like we are catching more pdfs as
> pdfs (that text/xhtml) than we were...great!  We're identifying more files
> as images (jpeg, pict) than as xhtml, and, from a quick look, this appears
> to be an improvement.  We have at least 9 new x-hwp-v5 (great!).
>
> On the negative side:
>
> 1) We have a few regressions in ppt exceptions (six of the same aioobe).
> 2) We have regressions in ppt content (it looks like we're not adding a
> new line/word break where we need to).  The regressions are small per file,
> but they affect ~220 ppts out of ~1500 (~15%).
>
> Other than the regressions in ppt content, I'd be +1, but I don't think
> this is severe enough to warrant a re-spin.  Happy to look into a fix,
> though, if we want a re-spin...and even if we don't, I'll start looking
> into this asap.
>
> -----Original Message-----
> From: Mattmann, Chris A (3980) [mailto:[hidden email]]
> Sent: Monday, October 19, 2015 10:23 AM
> To: [hidden email]
> Cc: [hidden email]
> Subject: [VOTE] Apache Tika 1.11 Release Candidate #1
>
> Hi Folks,
>
> A first candidate for the Tika 1.11 release is available at:
>
>   https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
>   http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/
>
> The SHA1 checksum of the archive is
> d0dde7b3a4f1a2fb6ccd741552ea180dddab630a
>
> In addition, a staged maven repository is available here:
>
> https://repository.apache.org/content/repositories/orgapachetika-1014/
>
>
> Please vote on releasing this package as Apache Tika 1.11.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.11 [ ] -1 Do not release this
> package because…
>
> Cheers,
> Chris
>
> P.S. Of course here is my +1.
>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398) NASA Jet
> Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: [hidden email]
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Associate Professor, Computer Science Department University of
> Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 1.11 Release Candidate #1

Bob Paulin-2
+1 - All projects build, test pass, OSGi bundle loads, checksums match.

- Bob

On 10/22/2015 8:49 PM, Tyler Palsulich wrote:

> +1 from me -- builds, tests pass, sanity check files parse, and sums look
> good. But, I get a warning that the signature is not certified with a
> trusted signature.
>
> Tyler
>
> On Wed, Oct 21, 2015 at 6:43 AM Allison, Timothy B. <[hidden email]>
> wrote:
>
>> +0 (some regressions in ppt content)
>>
>> I just finished the batch comparison run on  ~1.8 million files in our
>> govdocs1 and commoncrawl corpora comparing Tika 1.10 to 1.11-rc1.  As a
>> caveat, the eval code is still in development and there may be bugs in the
>> reports.
>>
>> Results are here:
>> https://github.com/tballison/share/blob/master/tika_comparisons/tika_1_10_vs_1_11-rc1.zip
>>
>> Key reports:
>> contents/content_diffs.csv (file had one corrupt row when viewing in
>> Excel...manually deleted offending content)
>> exceptions/newExceptionsInBByMimeTypeByStackTrace.csv (small handful)
>> exceptions/fixedExceptionsInBByMimeType.csv  (none!)
>> mimes/mime_diffs_A_to_B.csv
>>
>> On the positive side:
>>  From "mime_diffs_A_to_B.csv", it looks like we are catching more pdfs as
>> pdfs (that text/xhtml) than we were...great!  We're identifying more files
>> as images (jpeg, pict) than as xhtml, and, from a quick look, this appears
>> to be an improvement.  We have at least 9 new x-hwp-v5 (great!).
>>
>> On the negative side:
>>
>> 1) We have a few regressions in ppt exceptions (six of the same aioobe).
>> 2) We have regressions in ppt content (it looks like we're not adding a
>> new line/word break where we need to).  The regressions are small per file,
>> but they affect ~220 ppts out of ~1500 (~15%).
>>
>> Other than the regressions in ppt content, I'd be +1, but I don't think
>> this is severe enough to warrant a re-spin.  Happy to look into a fix,
>> though, if we want a re-spin...and even if we don't, I'll start looking
>> into this asap.
>>
>> -----Original Message-----
>> From: Mattmann, Chris A (3980) [mailto:[hidden email]]
>> Sent: Monday, October 19, 2015 10:23 AM
>> To: [hidden email]
>> Cc: [hidden email]
>> Subject: [VOTE] Apache Tika 1.11 Release Candidate #1
>>
>> Hi Folks,
>>
>> A first candidate for the Tika 1.11 release is available at:
>>
>>    https://dist.apache.org/repos/dist/dev/tika/
>>
>> The release candidate is a zip archive of the sources in:
>>    http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/
>>
>> The SHA1 checksum of the archive is
>> d0dde7b3a4f1a2fb6ccd741552ea180dddab630a
>>
>> In addition, a staged maven repository is available here:
>>
>> https://repository.apache.org/content/repositories/orgapachetika-1014/
>>
>>
>> Please vote on releasing this package as Apache Tika 1.11.
>> The vote is open for the next 72 hours and passes if a majority of at
>> least three +1 Tika PMC votes are cast.
>>
>> [ ] +1 Release this package as Apache Tika 1.11 [ ] -1 Do not release this
>> package because…
>>
>> Cheers,
>> Chris
>>
>> P.S. Of course here is my +1.
>>
>>
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398) NASA Jet
>> Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: [hidden email]
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Associate Professor, Computer Science Department University of
>> Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>

Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 1.11 Release Candidate #1

David Meikle
In reply to this post by Mattmann, Chris A (3010)
Hello ,

> On 19 Oct 2015, at 15:23, Mattmann, Chris A (3980) <[hidden email]> wrote:
>
> A first candidate for the Tika 1.11 release is available at:
>
>  https://dist.apache.org/repos/dist/dev/tika/ <https://dist.apache.org/repos/dist/dev/tika/>
>
> The release candidate is a zip archive of the sources in:
>  http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/ <http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/>
>
> The SHA1 checksum of the archive is
> d0dde7b3a4f1a2fb6ccd741552ea180dddab630a
>
> In addition, a staged maven repository is available here:
>
> https://repository.apache.org/content/repositories/orgapachetika-1014/ <https://repository.apache.org/content/repositories/orgapachetika-1014/>
>
>
> Please vote on releasing this package as Apache Tika 1.11.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.11
> [ ] -1 Do not release this package because…

+1 from me. Build and tests pass on OS X and Windows. Sigs good. I get the same non-trusted signature though.

Cheers,
Dave
Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 1.11 Release Candidate #1

Oleg Tikhonov-2
Hi guys, all looks fine on basic set up in x86_64 Ubuntu, however I got the
following:
Running org.apache.tika.parser.journal.JournalParserTest
25 Oct 2015 10:45:53  WARN PhaseInterceptorChain - Interceptor for {
http://localhost:8080/grobid}WebClient has thrown exception, unwinding now
org.apache.cxf.interceptor.Fault: Could not send Message.
    at
org.apache.cxf.interceptor.MessageSenderInterceptor$MessageSenderEndingInterceptor.handleMessage(MessageSenderInterceptor.java:64)
    at
org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:307)
    at
org.apache.cxf.jaxrs.client.AbstractClient.doRunInterceptorChain(AbstractClient.java:623)
    at
org.apache.cxf.jaxrs.client.WebClient.doChainedInvocation(WebClient.java:1084)
    at org.apache.cxf.jaxrs.client.WebClient.doInvoke(WebClient.java:883)
    at org.apache.cxf.jaxrs.client.WebClient.doInvoke(WebClient.java:854)
    at org.apache.cxf.jaxrs.client.WebClient.invoke(WebClient.java:320)
    at org.apache.cxf.jaxrs.client.WebClient.get(WebClient.java:346)
    at
org.apache.tika.parser.journal.GrobidRESTParser.canRun(GrobidRESTParser.java:102)
    at
org.apache.tika.parser.journal.JournalParserTest.testJournalParser(JournalParserTest.java:39)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
    at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:606)
    at
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
    at
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
    at
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
    at
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
    at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
    at
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
    at
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
    at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
    at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
    at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
    at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
    at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
    at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
    at
org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:283)
    at
org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:173)
    at
org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:153)
    at
org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:128)
    at
org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:203)
    at
org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:155)
    at
org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103)
Caused by: java.net.ConnectException: ConnectException invoking
http://localhost:8080/grobid: Connection refused
    at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
    at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
    at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
    at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
    at
org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.mapException(HTTPConduit.java:1359)
    at
org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.close(HTTPConduit.java:1343)
    at
org.apache.cxf.transport.AbstractConduit.close(AbstractConduit.java:56)
    at org.apache.cxf.transport.http.HTTPConduit.close(HTTPConduit.java:638)
    at
org.apache.cxf.interceptor.MessageSenderInterceptor$MessageSenderEndingInterceptor.handleMessage(MessageSenderInterceptor.java:62)
    ... 33 more
Caused by: java.net.ConnectException: Connection refused
    at java.net.PlainSocketImpl.socketConnect(Native Method)
    at
java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339)
    at
java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200)
    at
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182)
    at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
    at java.net.Socket.connect(Socket.java:579)
    at sun.net.NetworkClient.doConnect(NetworkClient.java:175)
    at sun.net.www.http.HttpClient.openServer(HttpClient.java:432)
    at sun.net.www.http.HttpClient.openServer(HttpClient.java:527)
    at sun.net.www.http.HttpClient.<init>(HttpClient.java:211)
    at sun.net.www.http.HttpClient.New(HttpClient.java:308)
    at sun.net.www.http.HttpClient.New(HttpClient.java:326)
    at
sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:997)
    at
sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:933)
    at
sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:851)
    at
sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1301)
    at
java.net.HttpURLConnection.getResponseCode(HttpURLConnection.java:468)
    at
org.apache.cxf.transport.http.URLConnectionHTTPConduit$URLConnectionWrappedOutputStream.getResponseCode(URLConnectionHTTPConduit.java:266)
    at
org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.handleResponseInternal(HTTPConduit.java:1557)
    at
org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.handleResponse(HTTPConduit.java:1527)
    at
org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.close(HTTPConduit.java:1330)
    ... 36 more

Seems like in the test web app was not running.
Except that, all seems good.

+0.

BR,
Oleg

On Sat, Oct 24, 2015 at 3:22 PM, David Meikle <[hidden email]> wrote:

> Hello ,
> > On 19 Oct 2015, at 15:23, Mattmann, Chris A (3980) <
> [hidden email]> wrote:
> >
> > A first candidate for the Tika 1.11 release is available at:
> >
> >  https://dist.apache.org/repos/dist/dev/tika/ <
> https://dist.apache.org/repos/dist/dev/tika/>
> >
> > The release candidate is a zip archive of the sources in:
> >  http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/ <
> http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/>
> >
> > The SHA1 checksum of the archive is
> > d0dde7b3a4f1a2fb6ccd741552ea180dddab630a
> >
> > In addition, a staged maven repository is available here:
> >
> > https://repository.apache.org/content/repositories/orgapachetika-1014/ <
> https://repository.apache.org/content/repositories/orgapachetika-1014/>
> >
> >
> > Please vote on releasing this package as Apache Tika 1.11.
> > The vote is open for the next 72 hours and passes if a majority of at
> > least three +1 Tika PMC votes are cast.
> >
> > [ ] +1 Release this package as Apache Tika 1.11
> > [ ] -1 Do not release this package because…
>
> +1 from me. Build and tests pass on OS X and Windows. Sigs good. I get the
> same non-trusted signature though.
>
> Cheers,
> Dave
Reply | Threaded
Open this post in threaded view
|

Re: [VOTE] Apache Tika 1.11 Release Candidate #1

Mattmann, Chris A (3010)
Thanks Oleg,

As long as the test was skipped, it’s fine. Thanks!

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: [hidden email]
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: <[hidden email]> on behalf of Oleg Tikhonov <[hidden email]>
Reply-To: "[hidden email]" <[hidden email]>
Date: Sunday, October 25, 2015 at 2:01 AM
To: "[hidden email]" <[hidden email]>
Cc: "[hidden email]" <[hidden email]>
Subject: Re: [VOTE] Apache Tika 1.11 Release Candidate #1

>Hi guys, all looks fine on basic set up in x86_64 Ubuntu, however I got
>the following:
>Running org.apache.tika.parser.journal.JournalParserTest
>25 Oct 2015 10:45:53  WARN PhaseInterceptorChain - Interceptor for
>{http://localhost:8080/grobid}WebClient has thrown exception, unwinding
>now
>org.apache.cxf.interceptor.Fault: Could not send Message.
>    at
>org.apache.cxf.interceptor.MessageSenderInterceptor$MessageSenderEndingInt
>erceptor.handleMessage(MessageSenderInterceptor.java:64)
>    at
>org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorCha
>in.java:307)
>    at
>org.apache.cxf.jaxrs.client.AbstractClient.doRunInterceptorChain(AbstractC
>lient.java:623)
>    at
>org.apache.cxf.jaxrs.client.WebClient.doChainedInvocation(WebClient.java:1
>084)
>    at org.apache.cxf.jaxrs.client.WebClient.doInvoke(WebClient.java:883)
>    at org.apache.cxf.jaxrs.client.WebClient.doInvoke(WebClient.java:854)
>    at org.apache.cxf.jaxrs.client.WebClient.invoke(WebClient.java:320)
>    at org.apache.cxf.jaxrs.client.WebClient.get(WebClient.java:346)
>    at
>org.apache.tika.parser.journal.GrobidRESTParser.canRun(GrobidRESTParser.ja
>va:102)
>    at
>org.apache.tika.parser.journal.JournalParserTest.testJournalParser(Journal
>ParserTest.java:39)
>    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>    at
>sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:
>57)
>    at
>sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorIm
>pl.java:43)
>    at java.lang.reflect.Method.invoke(Method.java:606)
>    at
>org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMetho
>d.java:47)
>    at
>org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable
>.java:12)
>    at
>org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.
>java:44)
>    at
>org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.j
>ava:17)
>    at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
>    at
>org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.j
>ava:70)
>    at
>org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.j
>ava:50)
>    at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>    at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
>    at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
>    at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
>    at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
>    at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
>    at
>org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.jav
>a:283)
>    at
>org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Pro
>vider.java:173)
>    at
>org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provi
>der.java:153)
>    at
>org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java
>:128)
>    at
>org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoa
>der(ForkedBooter.java:203)
>    at
>org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBoo
>ter.java:155)
>    at
>org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103)
>Caused by: java.net.ConnectException: ConnectException invoking
>http://localhost:8080/grobid <http://localhost:8080/grobid>: Connection
>refused
>    at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
>Method)
>    at
>sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAcc
>essorImpl.java:57)
>    at
>sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstr
>uctorAccessorImpl.java:45)
>    at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
>    at
>org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.mapException
>(HTTPConduit.java:1359)
>    at
>org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.close(HTTPCo
>nduit.java:1343)
>    at
>org.apache.cxf.transport.AbstractConduit.close(AbstractConduit.java:56)
>    at
>org.apache.cxf.transport.http.HTTPConduit.close(HTTPConduit.java:638)
>    at
>org.apache.cxf.interceptor.MessageSenderInterceptor$MessageSenderEndingInt
>erceptor.handleMessage(MessageSenderInterceptor.java:62)
>    ... 33 more
>Caused by: java.net.ConnectException: Connection refused
>    at java.net.PlainSocketImpl.socketConnect(Native Method)
>    at
>java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:33
>9)
>    at
>java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.
>java:200)
>    at
>java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182)
>    at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
>    at java.net.Socket.connect(Socket.java:579)
>    at sun.net.NetworkClient.doConnect(NetworkClient.java:175)
>    at sun.net.www.http.HttpClient.openServer(HttpClient.java:432)
>    at sun.net.www.http.HttpClient.openServer(HttpClient.java:527)
>    at sun.net.www.http.HttpClient.<init>(HttpClient.java:211)
>    at sun.net.www.http.HttpClient.New(HttpClient.java:308)
>    at sun.net.www.http.HttpClient.New(HttpClient.java:326)
>    at
>sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnec
>tion.java:997)
>    at
>sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection
>.java:933)
>    at
>sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java
>:851)
>    at
>sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnecti
>on.java:1301)
>    at
>java.net.HttpURLConnection.getResponseCode(HttpURLConnection.java:468)
>    at
>org.apache.cxf.transport.http.URLConnectionHTTPConduit$URLConnectionWrappe
>dOutputStream.getResponseCode(URLConnectionHTTPConduit.java:266)
>    at
>org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.handleRespon
>seInternal(HTTPConduit.java:1557)
>    at
>org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.handleRespon
>se(HTTPConduit.java:1527)
>    at
>org.apache.cxf.transport.http.HTTPConduit$WrappedOutputStream.close(HTTPCo
>nduit.java:1330)
>    ... 36 more
>
>
>Seems like in the test web app was not running.
>
>Except that, all seems good.
>
>+0.
>
>
>BR,
>
>Oleg
>
>
>On Sat, Oct 24, 2015 at 3:22 PM, David Meikle
><[hidden email]> wrote:
>
>Hello ,
>> On 19 Oct 2015, at 15:23, Mattmann, Chris A (3980)
>><[hidden email]> wrote:
>>
>> A first candidate for the Tika 1.11 release is available at:
>>
>>  
>https://dist.apache.org/repos/dist/dev/tika/
><https://dist.apache.org/repos/dist/dev/tika/>
><https://dist.apache.org/repos/dist/dev/tika/>
>>
>> The release candidate is a zip archive of the sources in:
>>  
>http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/
><http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/>
><http://svn.apache.org/repos/asf/tika/tags/1.11-rc1/>
>>
>> The SHA1 checksum of the archive is
>> d0dde7b3a4f1a2fb6ccd741552ea180dddab630a
>>
>> In addition, a staged maven repository is available here:
>>
>>
>https://repository.apache.org/content/repositories/orgapachetika-1014/
><https://repository.apache.org/content/repositories/orgapachetika-1014/>
><https://repository.apache.org/content/repositories/orgapachetika-1014/>
>>
>>
>> Please vote on releasing this package as Apache Tika 1.11.
>> The vote is open for the next 72 hours and passes if a majority of at
>> least three +1 Tika PMC votes are cast.
>>
>> [ ] +1 Release this package as Apache Tika 1.11
>> [ ] -1 Do not release this package because…
>
>+1 from me. Build and tests pass on OS X and Windows. Sigs good. I get
>the same non-trusted signature though.
>
>Cheers,
>Dave
>
>
>
>