Ken, Thank you for finding this and sharing it. I haven't seen this on my mac or ubuntu...not denying what you are seeing!
Are you able to build 1.24.1 with no problem? I wonder if your system is using a different SAXParser which is not handled correctly in XMLReaderUtils? What OS, what version of java? Thank you, again. Best, Tim On Mon, Nov 23, 2020 at 1:40 PM Ken Krugler <kkrugler_li...@transpac.com> wrote: > Hi all, > > I got past the JCE issue, but now some tests are failing with timeouts. > > For this test: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest > > I get 100s of these warnings: > > Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > > And then: > > [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: > 318.512 s <<< FAILURE! - in > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest > [ERROR] > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint > Time elapsed: 308.223 s <<< ERROR! > org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException > from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a > SAXParser; This could indicate that a parser has not correctly released its > SAXParser. Please report this to the Tika team: dev@tika.apache.org > <mailto:dev@tika.apache.org> > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > Caused by: org.apache.tika.exception.TikaException: Waited more than 5 > minutes for a SAXParser; This could indicate that a parser has not > correctly released its SAXParser. Please report this to the Tika team: > dev@tika.apache.org <mailto:dev@tika.apache.org> > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > > Similarly, for: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest > > Many of these: > > Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > > And then similarly: > > [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: > 309.375 s <<< FAILURE! - in > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest > [ERROR] > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint > Time elapsed: 307.9 s <<< ERROR! > org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException > from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a > SAXParser; This could indicate that a parser has not correctly released its > SAXParser. Please report this to the Tika team: dev@tika.apache.org > <mailto:dev@tika.apache.org> > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > Caused by: org.apache.tika.exception.TikaException: Waited more than 5 > minutes for a SAXParser; This could indicate that a parser has not > correctly released its SAXParser. Please report this to the Tika team: > dev@tika.apache.org <mailto:dev@tika.apache.org> > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > > And now: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest > [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: > 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest > [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest > [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: > 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest > [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest > Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > … and so on… > > Any suggestions? > > Thanks! > > — Ken > > -------------------------- > Ken Krugler > http://www.scaleunlimited.com > custom big data solutions & training > Hadoop, Cascading, Cassandra & Solr > >