Hello,

My ODT (Open Office) files attachments are detected like "zip bomb" files by 
tika and i think theses file are not proprely indexed by solr...
Do you know a way to exclude, odt files from tika' zip bomb check?
Thxs.

Here my log for odt files:
2016-11-14 11:33:04,474 [XWiki Solr index thread] ERROR 
.DocumentSolrMetadataExtractor - Failed to retrieve the content of attachment 
[Attachment xwiki:MyFile.odt] 
org.apache.tika.exception.TikaException: Zip bomb detected!
        at 
org.apache.tika.sax.SecureContentHandler.throwIfCauseOf(SecureContentHandler.java:192)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:123)
        at org.apache.tika.Tika.parseToString(Tika.java:527)
...
        at 
org.xwiki.search.solr.internal.DefaultSolrIndexer.run(DefaultSolrIndexer.java:377)
        at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.tika.sax.SecureContentHandler$SecureSAXException: 
Suspected zip bomb: 100 levels of XML element nesting
        at 
org.apache.tika.sax.SecureContentHandler.startElement(SecureContentHandler.java:234)
...
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
        ... 11 common frames omitted


Pascal B
_______________________________________________
users mailing list
users@xwiki.org
http://lists.xwiki.org/mailman/listinfo/users

Reply via email to