More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

classic Classic list List threaded Threaded
6 messages Options
Reply | Threaded
Open this post in threaded view
|

More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

kkrugler
Hi all,

I got past the JCE issue, but now some tests are failing with timeouts.

For this test:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest

I get 100s of these warnings:

Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE

And then:

[ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: 318.512 s <<< FAILURE! - in org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
[ERROR] org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint  Time elapsed: 308.223 s  <<< ERROR!
org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
        at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email] <mailto:[hidden email]>
        at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email] <mailto:[hidden email]>
        at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)

Similarly, for:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest

Many of these:

Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE

And then similarly:

[ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: 309.375 s <<< FAILURE! - in org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
[ERROR] org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint  Time elapsed: 307.9 s  <<< ERROR!
org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
        at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email] <mailto:[hidden email]>
        at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email] <mailto:[hidden email]>
        at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)

And now:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
[INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
[INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
[INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE
Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE
Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE
… and so on…

Any suggestions?

Thanks!

— Ken

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr

Reply | Threaded
Open this post in threaded view
|

Re: More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

Tim Allison
Ken,
  Thank you for finding this and sharing it.  I haven't seen this on my mac
or ubuntu...not denying what you are seeing!

  Are you able to build 1.24.1 with no problem?  I wonder if your system is
using a different SAXParser which is not handled correctly in
XMLReaderUtils?  What OS, what version of java?

  Thank you, again.

      Best,

              Tim

On Mon, Nov 23, 2020 at 1:40 PM Ken Krugler <[hidden email]>
wrote:

> Hi all,
>
> I got past the JCE issue, but now some tests are failing with timeouts.
>
> For this test:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>
> I get 100s of these warnings:
>
> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then:
>
> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 318.512 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
> [ERROR]
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
> Time elapsed: 308.223 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: [hidden email]
> <mailto:[hidden email]>
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> [hidden email] <mailto:[hidden email]>
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>
> Similarly, for:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>
> Many of these:
>
> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then similarly:
>
> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 309.375 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
> [ERROR]
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
> Time elapsed: 307.9 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: [hidden email]
> <mailto:[hidden email]>
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> [hidden email] <mailto:[hidden email]>
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>
> And now:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> … and so on…
>
> Any suggestions?
>
> Thanks!
>
> — Ken
>
> --------------------------
> Ken Krugler
> http://www.scaleunlimited.com
> custom big data solutions & training
> Hadoop, Cascading, Cassandra & Solr
>
>
Reply | Threaded
Open this post in threaded view
|

Re: More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

kkrugler
Hi Tim,

> On Nov 23, 2020, at 10:46 AM, Tim Allison <[hidden email]> wrote:
>
> Ken,
>  Thank you for finding this and sharing it.  I haven't seen this on my mac
> or ubuntu...not denying what you are seeing!
>
>  Are you able to build 1.24.1 with no problem?  

Trying that now…

> I wonder if your system is
> using a different SAXParser which is not handled correctly in
> XMLReaderUtils?  What OS, what version of java?

Mac OS 10.12.6

java version "1.8.0_131"
Java(TM) SE Runtime Environment (build 1.8.0_131-b11)
Java HotSpot(TM) 64-Bit Server VM (build 25.131-b11, mixed mode)

— Ken

> On Mon, Nov 23, 2020 at 1:40 PM Ken Krugler <[hidden email]>
> wrote:
>
>> Hi all,
>>
>> I got past the JCE issue, but now some tests are failing with timeouts.
>>
>> For this test:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>>
>> I get 100s of these warnings:
>>
>> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>>
>> And then:
>>
>> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 318.512 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>> [ERROR]
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
>> Time elapsed: 308.223 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released its
>> SAXParser. Please report this to the Tika team: [hidden email]
>> <mailto:[hidden email]>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> [hidden email] <mailto:[hidden email]>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>>
>> Similarly, for:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>>
>> Many of these:
>>
>> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>>
>> And then similarly:
>>
>> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 309.375 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>> [ERROR]
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
>> Time elapsed: 307.9 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released its
>> SAXParser. Please report this to the Tika team: [hidden email]
>> <mailto:[hidden email]>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> [hidden email] <mailto:[hidden email]>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>>
>> And now:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
>> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
>> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> … and so on…
>>
>> Any suggestions?
>>
>> Thanks!
>>
>> — Ken
>>
>> --------------------------
>> Ken Krugler
>> http://www.scaleunlimited.com
>> custom big data solutions & training
>> Hadoop, Cascading, Cassandra & Solr
>>
>>

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr

Reply | Threaded
Open this post in threaded view
|

Re: More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

kkrugler
In reply to this post by Tim Allison
Hi Tim,

I was able to build 1.24.1 without any problems.

I think tried “mvn clean package” on 1.25-rc1, and this time it made it past the parsers, but failed on OSGi.

But what’s weird is there wasn’t any error reported for that part of the build:

[INFO] ------------------------------------------------------------------------
[INFO] Building Apache Tika OSGi bundle 1.25
[INFO] ------------------------------------------------------------------------
Downloading: https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.pom
Downloaded: https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.pom (1.4 kB at 7.1 kB/s)
Downloading: https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.jar
Downloaded: https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.jar (55 kB at 951 kB/s)
[INFO]
[INFO] --- maven-clean-plugin:2.5:clean (default-clean) @ tika-bundle ---
[INFO] Deleting /Users/kenkrugler/git/tika/tika-bundle/target
[INFO]
[INFO] --- ossindex-maven-plugin:3.1.0:audit (audit-dependencies) @ tika-bundle ---
[INFO] Checking for vulnerabilities; 153 artifacts
[INFO] Exclude coordinates: [junit:junit:4.13.1]
[INFO] Exclude vulnerability identifiers: []
[INFO] CVSS-score threshold: 0.0
[WARNING] Excluding coordinates: junit:junit:4.13.1
[INFO]
[INFO] --- maven-enforcer-plugin:3.0.0-M3:enforce (enforce) @ tika-bundle ---
[INFO]
[INFO] --- maven-remote-resources-plugin:1.5:process (default) @ tika-bundle ---
[INFO]
[INFO] --- maven-resources-plugin:2.7:resources (default-resources) @ tika-bundle ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 1 resource
[INFO] Copying 3 resources
[INFO]
[INFO] --- maven-compiler-plugin:3.8.0:compile (default-compile) @ tika-bundle ---
[INFO] No sources to compile
[INFO]
[INFO] --- maven-resources-plugin:2.7:testResources (default-testResources) @ tika-bundle ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 4 resources
[INFO] Copying 3 resources
[INFO]
[INFO] --- maven-compiler-plugin:3.8.0:testCompile (default-testCompile) @ tika-bundle ---
[INFO] Changes detected - recompiling the module!
[INFO] Compiling 1 source file to /Users/kenkrugler/git/tika/tika-bundle/target/test-classes
[INFO] /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java: /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java uses or overrides a deprecated API.
[INFO] /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java: Recompile with -Xlint:deprecation for details.
[INFO]
[INFO] --- maven-surefire-plugin:3.0.0-M4:test (default-test) @ tika-bundle ---
[INFO]
[INFO] --- maven-dependency-plugin:2.8:unpack-dependencies (default) @ tika-bundle ---
[INFO]
[INFO] --- maven-bundle-plugin:5.1.1:bundle (default-bundle) @ tika-bundle ---
[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary:
[INFO]
[INFO] Apache Tika parent ................................. SUCCESS [  1.703 s]
[INFO] Apache Tika core ................................... SUCCESS [ 35.949 s]
[INFO] Apache Tika parsers ................................ SUCCESS [04:33 min]
[INFO] Apache Tika OSGi bundle ............................ FAILURE [12:51 min]

It hangs for about 10 minutes after printing out the “[INFO] — maven-bundle-plugin:5.1.1:bundle (default-bundle) @ tika-bundle ---” line.

 So…any ideas about this? I’d seen some discussion about maven versioning being an issue recently? I’ve got:

Apache Maven 3.5.0 (ff8f5e7444045639af65f6095c62210b5713f426; 2017-04-03T12:39:06-07:00)
Maven home: /Users/kenkrugler/Tools/apache-maven
Java version: 1.8.0_131, vendor: Oracle Corporation
Java home: /Library/Java/JavaVirtualMachines/jdk1.8.0_131.jdk/Contents/Home/jre
Default locale: en_US, platform encoding: UTF-8
OS name: "mac os x", version: "10.12.6", arch: "x86_64", family: "mac"

Thanks,

— Ken


> On Nov 23, 2020, at 10:46 AM, Tim Allison <[hidden email]> wrote:
>
> Ken,
>  Thank you for finding this and sharing it.  I haven't seen this on my mac
> or ubuntu...not denying what you are seeing!
>
>  Are you able to build 1.24.1 with no problem?  I wonder if your system is
> using a different SAXParser which is not handled correctly in
> XMLReaderUtils?  What OS, what version of java?
>
>  Thank you, again.
>
>      Best,
>
>              Tim
>
> On Mon, Nov 23, 2020 at 1:40 PM Ken Krugler <[hidden email]>
> wrote:
>
>> Hi all,
>>
>> I got past the JCE issue, but now some tests are failing with timeouts.
>>
>> For this test:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>>
>> I get 100s of these warnings:
>>
>> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>>
>> And then:
>>
>> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 318.512 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>> [ERROR]
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
>> Time elapsed: 308.223 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released its
>> SAXParser. Please report this to the Tika team: [hidden email]
>> <mailto:[hidden email]>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> [hidden email] <mailto:[hidden email]>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>>
>> Similarly, for:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>>
>> Many of these:
>>
>> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>>
>> And then similarly:
>>
>> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 309.375 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>> [ERROR]
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
>> Time elapsed: 307.9 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released its
>> SAXParser. Please report this to the Tika team: [hidden email]
>> <mailto:[hidden email]>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> [hidden email] <mailto:[hidden email]>
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>>
>> And now:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
>> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
>> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> … and so on…
>>
>> Any suggestions?
>>
>> Thanks!
>>
>> — Ken
>>
>> --------------------------
>> Ken Krugler
>> http://www.scaleunlimited.com
>> custom big data solutions & training
>> Hadoop, Cascading, Cassandra & Solr
>>
>>

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr

Reply | Threaded
Open this post in threaded view
|

Re: More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

Tim Allison
I'm still having the can't replicate blues...  Again, I completely trust
your findings...argh...

[*INFO*] *BUILD SUCCESS*

[*INFO*]
*------------------------------------------------------------------------*

[*INFO*] Total time: 11:49 min

[*INFO*] Finished at: 2020-11-23T17:39:28-05:00

[*INFO*] Final Memory: 630M/3162M


Maven home: /Users/allison/Downloads/mvn/apache-maven-3.5.0

Java version: 1.8.0_131, vendor: Oracle Corporation

Java home:
/Library/Java/JavaVirtualMachines/jdk1.8.0_131.jdk/Contents/Home/jre

Default locale: en_US, platform encoding: UTF-8

OS name: "mac os x", version: "10.14.6", arch: "x86_64", family: "mac"

On Mon, Nov 23, 2020 at 5:04 PM Ken Krugler <[hidden email]>
wrote:

> Hi Tim,
>
> I was able to build 1.24.1 without any problems.
>
> I think tried “mvn clean package” on 1.25-rc1, and this time it made it
> past the parsers, but failed on OSGi.
>
> But what’s weird is there wasn’t any error reported for that part of the
> build:
>
> [*INFO*]
> *------------------------------------------------------------------------*
> [*INFO*] *Building Apache Tika OSGi bundle 1.25*
> [*INFO*]
> *------------------------------------------------------------------------*
> Downloading:
> https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.pom
> Downloaded:
> https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.pom
> (1.4 kB at 7.1 kB/s)
> Downloading:
> https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.jar
> Downloaded:
> https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.jar
> (55 kB at 951 kB/s)
> [*INFO*]
> [*INFO*] *--- *maven-clean-plugin:2.5:clean *(default-clean)* @
> tika-bundle* ---*
> [*INFO*] Deleting /Users/kenkrugler/git/tika/tika-bundle/target
> [*INFO*]
> [*INFO*] *--- *ossindex-maven-plugin:3.1.0:audit *(audit-dependencies)* @
> tika-bundle* ---*
> [*INFO*] Checking for vulnerabilities; 153 artifacts
> [*INFO*] Exclude coordinates: [junit:junit:4.13.1]
> [*INFO*] Exclude vulnerability identifiers: []
> [*INFO*] CVSS-score threshold: 0.0
> [*WARNING*] Excluding coordinates: junit:junit:4.13.1
> [*INFO*]
> [*INFO*] *--- *maven-enforcer-plugin:3.0.0-M3:enforce *(enforce)* @
> tika-bundle* ---*
> [*INFO*]
> [*INFO*] *--- *maven-remote-resources-plugin:1.5:process *(default)* @
> tika-bundle* ---*
> [*INFO*]
> [*INFO*] *--- *maven-resources-plugin:2.7:resources *(default-resources)*
> @ tika-bundle* ---*
> [*INFO*] Using 'UTF-8' encoding to copy filtered resources.
> [*INFO*] Copying 1 resource
> [*INFO*] Copying 3 resources
> [*INFO*]
> [*INFO*] *--- *maven-compiler-plugin:3.8.0:compile *(default-compile)* @
> tika-bundle* ---*
> [*INFO*] No sources to compile
> [*INFO*]
> [*INFO*] *--- *maven-resources-plugin:2.7:testResources
> *(default-testResources)* @ tika-bundle* ---*
> [*INFO*] Using 'UTF-8' encoding to copy filtered resources.
> [*INFO*] Copying 4 resources
> [*INFO*] Copying 3 resources
> [*INFO*]
> [*INFO*] *--- *maven-compiler-plugin:3.8.0:testCompile
> *(default-testCompile)* @ tika-bundle* ---*
> [*INFO*] Changes detected - recompiling the module!
> [*INFO*] Compiling 1 source file to
> /Users/kenkrugler/git/tika/tika-bundle/target/test-classes
> [*INFO*]
> /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java:
> /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java
> uses or overrides a deprecated API.
> [*INFO*]
> /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java:
> Recompile with -Xlint:deprecation for details.
> [*INFO*]
> [*INFO*] *--- *maven-surefire-plugin:3.0.0-M4:test *(default-test)* @
> tika-bundle* ---*
> [*INFO*]
> [*INFO*] *--- *maven-dependency-plugin:2.8:unpack-dependencies *(default)*
> @ tika-bundle* ---*
> [*INFO*]
> [*INFO*] *--- *maven-bundle-plugin:5.1.1:bundle *(default-bundle)* @
> tika-bundle* ---*
> [*INFO*]
> *------------------------------------------------------------------------*
> [*INFO*] *Reactor Summary:*
> [*INFO*]
> [*INFO*] Apache Tika parent ................................. *SUCCESS*
> [  1.703 s]
> [*INFO*] Apache Tika core ................................... *SUCCESS* [
> 35.949 s]
> [*INFO*] Apache Tika parsers ................................ *SUCCESS*
> [04:33 min]
> [*INFO*] Apache Tika OSGi bundle ............................ *FAILURE*
> [12:51 min]
>
> It hangs for about 10 minutes after printing out the “[INFO] —
> maven-bundle-plugin:5.1.1:bundle (default-bundle) @ tika-bundle ---” line.
>
>  So…any ideas about this? I’d seen some discussion about maven versioning
> being an issue recently? I’ve got:
>
> *Apache Maven 3.5.0 (ff8f5e7444045639af65f6095c62210b5713f426;
> 2017-04-03T12:39:06-07:00)*
> Maven home: /Users/kenkrugler/Tools/apache-maven
> Java version: 1.8.0_131, vendor: Oracle Corporation
> Java home:
> /Library/Java/JavaVirtualMachines/jdk1.8.0_131.jdk/Contents/Home/jre
> Default locale: en_US, platform encoding: UTF-8
> OS name: "mac os x", version: "10.12.6", arch: "x86_64", family: "mac"
>
> Thanks,
>
> — Ken
>
>
> On Nov 23, 2020, at 10:46 AM, Tim Allison <[hidden email]> wrote:
>
> Ken,
>  Thank you for finding this and sharing it.  I haven't seen this on my mac
> or ubuntu...not denying what you are seeing!
>
>  Are you able to build 1.24.1 with no problem?  I wonder if your system is
> using a different SAXParser which is not handled correctly in
> XMLReaderUtils?  What OS, what version of java?
>
>  Thank you, again.
>
>      Best,
>
>              Tim
>
> On Mon, Nov 23, 2020 at 1:40 PM Ken Krugler <[hidden email]>
> wrote:
>
> Hi all,
>
> I got past the JCE issue, but now some tests are failing with timeouts.
>
> For this test:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>
> I get 100s of these warnings:
>
> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then:
>
> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 318.512 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
> [ERROR]
>
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
> Time elapsed: 308.223 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>        at
>
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: [hidden email]
> <mailto:[hidden email] <[hidden email]>>
>        at
>
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> [hidden email] <mailto:[hidden email] <[hidden email]>>
>        at
>
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>
> Similarly, for:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>
> Many of these:
>
> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then similarly:
>
> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 309.375 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
> [ERROR]
>
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
> Time elapsed: 307.9 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>        at
>
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: [hidden email]
> <mailto:[hidden email] <[hidden email]>>
>        at
>
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> [hidden email] <mailto:[hidden email] <[hidden email]>>
>        at
>
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>
> And now:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> … and so on…
>
> Any suggestions?
>
> Thanks!
>
> — Ken
>
> --------------------------
> Ken Krugler
> http://www.scaleunlimited.com
> custom big data solutions & training
> Hadoop, Cascading, Cassandra & Solr
>
>
>
> --------------------------
> Ken Krugler
> http://www.scaleunlimited.com
> custom big data solutions & training
> Hadoop, Cascading, Cassandra & Solr
>
>
Reply | Threaded
Open this post in threaded view
|

Re: More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

Tim Allison
with hash

*Apache Maven 3.5.0 (ff8f5e7444045639af65f6095c62210b5713f426;
2017-04-03T15:39:06-04:00)*

Maven home: /Users/allison/Downloads/mvn/apache-maven-3.5.0

Java version: 1.8.0_131, vendor: Oracle Corporation

Java home:
/Library/Java/JavaVirtualMachines/jdk1.8.0_131.jdk/Contents/Home/jre

Default locale: en_US, platform encoding: UTF-8

OS name: "mac os x", version: "10.14.6", arch: "x86_64", family: "mac"

On Mon, Nov 23, 2020 at 5:40 PM Tim Allison <[hidden email]> wrote:

> I'm still having the can't replicate blues...  Again, I completely trust
> your findings...argh...
>
> [*INFO*] *BUILD SUCCESS*
>
> [*INFO*]
> *------------------------------------------------------------------------*
>
> [*INFO*] Total time: 11:49 min
>
> [*INFO*] Finished at: 2020-11-23T17:39:28-05:00
>
> [*INFO*] Final Memory: 630M/3162M
>
>
> Maven home: /Users/allison/Downloads/mvn/apache-maven-3.5.0
>
> Java version: 1.8.0_131, vendor: Oracle Corporation
>
> Java home:
> /Library/Java/JavaVirtualMachines/jdk1.8.0_131.jdk/Contents/Home/jre
>
> Default locale: en_US, platform encoding: UTF-8
>
> OS name: "mac os x", version: "10.14.6", arch: "x86_64", family: "mac"
>
> On Mon, Nov 23, 2020 at 5:04 PM Ken Krugler <[hidden email]>
> wrote:
>
>> Hi Tim,
>>
>> I was able to build 1.24.1 without any problems.
>>
>> I think tried “mvn clean package” on 1.25-rc1, and this time it made it
>> past the parsers, but failed on OSGi.
>>
>> But what’s weird is there wasn’t any error reported for that part of the
>> build:
>>
>> [*INFO*]
>> *------------------------------------------------------------------------*
>> [*INFO*] *Building Apache Tika OSGi bundle 1.25*
>> [*INFO*]
>> *------------------------------------------------------------------------*
>> Downloading:
>> https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.pom
>> Downloaded:
>> https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.pom
>> (1.4 kB at 7.1 kB/s)
>> Downloading:
>> https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.jar
>> Downloaded:
>> https://repo.maven.apache.org/maven2/org/osgi/org.osgi.service.cm/1.6.0/org.osgi.service.cm-1.6.0.jar
>> (55 kB at 951 kB/s)
>> [*INFO*]
>> [*INFO*] *--- *maven-clean-plugin:2.5:clean *(default-clean)* @
>> tika-bundle* ---*
>> [*INFO*] Deleting /Users/kenkrugler/git/tika/tika-bundle/target
>> [*INFO*]
>> [*INFO*] *--- *ossindex-maven-plugin:3.1.0:audit *(audit-dependencies)*
>> @ tika-bundle* ---*
>> [*INFO*] Checking for vulnerabilities; 153 artifacts
>> [*INFO*] Exclude coordinates: [junit:junit:4.13.1]
>> [*INFO*] Exclude vulnerability identifiers: []
>> [*INFO*] CVSS-score threshold: 0.0
>> [*WARNING*] Excluding coordinates: junit:junit:4.13.1
>> [*INFO*]
>> [*INFO*] *--- *maven-enforcer-plugin:3.0.0-M3:enforce *(enforce)* @
>> tika-bundle* ---*
>> [*INFO*]
>> [*INFO*] *--- *maven-remote-resources-plugin:1.5:process *(default)* @
>> tika-bundle* ---*
>> [*INFO*]
>> [*INFO*] *--- *maven-resources-plugin:2.7:resources *(default-resources)*
>> @ tika-bundle* ---*
>> [*INFO*] Using 'UTF-8' encoding to copy filtered resources.
>> [*INFO*] Copying 1 resource
>> [*INFO*] Copying 3 resources
>> [*INFO*]
>> [*INFO*] *--- *maven-compiler-plugin:3.8.0:compile *(default-compile)* @
>> tika-bundle* ---*
>> [*INFO*] No sources to compile
>> [*INFO*]
>> [*INFO*] *--- *maven-resources-plugin:2.7:testResources
>> *(default-testResources)* @ tika-bundle* ---*
>> [*INFO*] Using 'UTF-8' encoding to copy filtered resources.
>> [*INFO*] Copying 4 resources
>> [*INFO*] Copying 3 resources
>> [*INFO*]
>> [*INFO*] *--- *maven-compiler-plugin:3.8.0:testCompile
>> *(default-testCompile)* @ tika-bundle* ---*
>> [*INFO*] Changes detected - recompiling the module!
>> [*INFO*] Compiling 1 source file to
>> /Users/kenkrugler/git/tika/tika-bundle/target/test-classes
>> [*INFO*]
>> /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java:
>> /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java
>> uses or overrides a deprecated API.
>> [*INFO*]
>> /Users/kenkrugler/git/tika/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java:
>> Recompile with -Xlint:deprecation for details.
>> [*INFO*]
>> [*INFO*] *--- *maven-surefire-plugin:3.0.0-M4:test *(default-test)* @
>> tika-bundle* ---*
>> [*INFO*]
>> [*INFO*] *--- *maven-dependency-plugin:2.8:unpack-dependencies
>> *(default)* @ tika-bundle* ---*
>> [*INFO*]
>> [*INFO*] *--- *maven-bundle-plugin:5.1.1:bundle *(default-bundle)* @
>> tika-bundle* ---*
>> [*INFO*]
>> *------------------------------------------------------------------------*
>> [*INFO*] *Reactor Summary:*
>> [*INFO*]
>> [*INFO*] Apache Tika parent ................................. *SUCCESS*
>> [  1.703 s]
>> [*INFO*] Apache Tika core ................................... *SUCCESS*
>> [ 35.949 s]
>> [*INFO*] Apache Tika parsers ................................ *SUCCESS*
>> [04:33 min]
>> [*INFO*] Apache Tika OSGi bundle ............................ *FAILURE*
>> [12:51 min]
>>
>> It hangs for about 10 minutes after printing out the “[INFO] —
>> maven-bundle-plugin:5.1.1:bundle (default-bundle) @ tika-bundle ---” line.
>>
>>  So…any ideas about this? I’d seen some discussion about maven versioning
>> being an issue recently? I’ve got:
>>
>> *Apache Maven 3.5.0 (ff8f5e7444045639af65f6095c62210b5713f426;
>> 2017-04-03T12:39:06-07:00)*
>> Maven home: /Users/kenkrugler/Tools/apache-maven
>> Java version: 1.8.0_131, vendor: Oracle Corporation
>> Java home:
>> /Library/Java/JavaVirtualMachines/jdk1.8.0_131.jdk/Contents/Home/jre
>> Default locale: en_US, platform encoding: UTF-8
>> OS name: "mac os x", version: "10.12.6", arch: "x86_64", family: "mac"
>>
>> Thanks,
>>
>> — Ken
>>
>>
>> On Nov 23, 2020, at 10:46 AM, Tim Allison <[hidden email]> wrote:
>>
>> Ken,
>>  Thank you for finding this and sharing it.  I haven't seen this on my mac
>> or ubuntu...not denying what you are seeing!
>>
>>  Are you able to build 1.24.1 with no problem?  I wonder if your system is
>> using a different SAXParser which is not handled correctly in
>> XMLReaderUtils?  What OS, what version of java?
>>
>>  Thank you, again.
>>
>>      Best,
>>
>>              Tim
>>
>> On Mon, Nov 23, 2020 at 1:40 PM Ken Krugler <[hidden email]>
>> wrote:
>>
>> Hi all,
>>
>> I got past the JCE issue, but now some tests are failing with timeouts.
>>
>> For this test:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>>
>> I get 100s of these warnings:
>>
>> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>>
>> And then:
>>
>> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 318.512 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>> [ERROR]
>>
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
>> Time elapsed: 308.223 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>>
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released
>> its
>> SAXParser. Please report this to the Tika team: [hidden email]
>> <mailto:[hidden email] <[hidden email]>>
>>        at
>>
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> [hidden email] <mailto:[hidden email] <[hidden email]>>
>>        at
>>
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>>
>> Similarly, for:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>>
>> Many of these:
>>
>> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>>
>> And then similarly:
>>
>> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 309.375 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>> [ERROR]
>>
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
>> Time elapsed: 307.9 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>>
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released
>> its
>> SAXParser. Please report this to the Tika team: [hidden email]
>> <mailto:[hidden email] <[hidden email]>>
>>        at
>>
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> [hidden email] <mailto:[hidden email] <[hidden email]>>
>>        at
>>
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>>
>> And now:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
>> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
>> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> … and so on…
>>
>> Any suggestions?
>>
>> Thanks!
>>
>> — Ken
>>
>> --------------------------
>> Ken Krugler
>> http://www.scaleunlimited.com
>> custom big data solutions & training
>> Hadoop, Cascading, Cassandra & Solr
>>
>>
>>
>> --------------------------
>> Ken Krugler
>> http://www.scaleunlimited.com
>> custom big data solutions & training
>> Hadoop, Cascading, Cassandra & Solr
>>
>>