[ https://issues.apache.org/jira/browse/TIKA-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16534958#comment-16534958 ]
Hudson commented on TIKA-2675: ------------------------------ UNSTABLE: Integrated in Jenkins build tika-2.x-windows #281 (See [https://builds.apache.org/job/tika-2.x-windows/281/]) TIKA-2675 -- OpenDocumentParser should fail on invalid zip via Sebastian (tallison: rev c9a81a400ee10e9342bbfe718d62f0b0d6c7944f) * (add) tika-parsers/src/test/resources/test-documents/testODTnotaZipFile.odt * (edit) tika-parsers/src/main/java/org/apache/tika/parser/odf/OpenDocumentParser.java * (edit) tika-parsers/src/test/java/org/apache/tika/parser/odf/ODFParserTest.java > OpenDocumentParser should fail on invalid zip files > --------------------------------------------------- > > Key: TIKA-2675 > URL: https://issues.apache.org/jira/browse/TIKA-2675 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.18 > Reporter: Sebastian Nagel > Assignee: Tim Allison > Priority: Major > Fix For: 1.19, 2.0.0 > > > The OpenDocumentParser assumes a zip file as container. However, if it is > called on an invalid zip stream from a remote URL (see NUTCH-2603), the > parser signals success and returns a document with no/empty content. The > behavior is different when called on a local file: while the [constructor of > ZipFile|https://docs.oracle.com/javase/8/docs/api/java/util/zip/ZipFile.html#ZipFile-java.io.File-] > fails on invalid input, the [constructor of > ZipInputStream|https://docs.oracle.com/javase/8/docs/api/java/util/zip/ZipInputStream.html#ZipInputStream-java.io.InputStream-] > silently ignores the input. -- This message was sent by Atlassian JIRA (v7.6.3#76005)