Hi Arvid (and the dev list), Sorry - I was busy working on two open source projects at the same time, and messed up. The Tika dev list and the Flink dev list both show up as “dev” in Apple Email :(
This also explains why I didn’t get a response from my fellow Tika devs... — Ken > On Nov 23, 2020, at 4:34 AM, Arvid Heise <ar...@ververica.com> wrote: > > Hi Ken, > > just to double check, did you intend to send this mail to the tika dev > list? I actually don't know what to do with your email. > > Best, > > Arvid > > On Sat, Nov 21, 2020 at 11:43 PM Ken Krugler <kkrugler_li...@transpac.com> > wrote: > >> Hi all, >> >> I got past the JCE issue, but now some tests are failing with timeouts. >> >> For this test: >> >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest >> >> I get 100s of these warnings: >> >> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> >> And then: >> >> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: >> 318.512 s <<< FAILURE! - in >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest >> [ERROR] >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint >> Time elapsed: 308.223 s <<< ERROR! >> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException >> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 >> at >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) >> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a >> SAXParser; This could indicate that a parser has not correctly released its >> SAXParser. Please report this to the Tika team: d...@tika.apache.org >> at >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) >> Caused by: org.apache.tika.exception.TikaException: Waited more than 5 >> minutes for a SAXParser; This could indicate that a parser has not >> correctly released its SAXParser. Please report this to the Tika team: >> d...@tika.apache.org >> at >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) >> >> Similarly, for: >> >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest >> >> Many of these: >> >> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> >> And then similarly: >> >> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: >> 309.375 s <<< FAILURE! - in >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest >> [ERROR] >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint >> Time elapsed: 307.9 s <<< ERROR! >> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException >> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 >> at >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) >> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a >> SAXParser; This could indicate that a parser has not correctly released its >> SAXParser. Please report this to the Tika team: d...@tika.apache.org >> at >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) >> Caused by: org.apache.tika.exception.TikaException: Waited more than 5 >> minutes for a SAXParser; This could indicate that a parser has not >> correctly released its SAXParser. Please report this to the Tika team: >> d...@tika.apache.org >> at >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) >> >> And now: >> >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest >> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: >> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest >> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: >> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest >> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> … and so on… >> >> Any suggestions? >> >> Thanks! >> >> — Ken -------------------------- Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr