Ken,
  Thank you for finding this and sharing it.  I haven't seen this on my mac
or ubuntu...not denying what you are seeing!

  Are you able to build 1.24.1 with no problem?  I wonder if your system is
using a different SAXParser which is not handled correctly in
XMLReaderUtils?  What OS, what version of java?

  Thank you, again.

      Best,

              Tim

On Mon, Nov 23, 2020 at 1:40 PM Ken Krugler <kkrugler_li...@transpac.com>
wrote:

> Hi all,
>
> I got past the JCE issue, but now some tests are failing with timeouts.
>
> For this test:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>
> I get 100s of these warnings:
>
> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then:
>
> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 318.512 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
> [ERROR]
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
> Time elapsed: 308.223 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: dev@tika.apache.org
> <mailto:dev@tika.apache.org>
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> dev@tika.apache.org <mailto:dev@tika.apache.org>
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>
> Similarly, for:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>
> Many of these:
>
> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then similarly:
>
> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 309.375 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
> [ERROR]
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
> Time elapsed: 307.9 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: dev@tika.apache.org
> <mailto:dev@tika.apache.org>
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> dev@tika.apache.org <mailto:dev@tika.apache.org>
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>
> And now:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> … and so on…
>
> Any suggestions?
>
> Thanks!
>
> — Ken
>
> --------------------------
> Ken Krugler
> http://www.scaleunlimited.com
> custom big data solutions & training
> Hadoop, Cascading, Cassandra & Solr
>
>

Reply via email to