David Avant created TIKA-3718:
---------------------------------
Summary: Special PDF document causes Tika parser to hang
Key: TIKA-3718
URL: https://issues.apache.org/jira/browse/TIKA-3718
Project: Tika
Issue Type: Bug
Components: app
Affects Versions: 2.3.0, 1.28.1
Environment: The problem can be reproduced under (Windows + Java8).
However, the problem does not appear to be environment specific.
Reporter: David Avant
Attachments: map.pdf
Attempting to parse the attached "map.pdf" causes the Tika parser to hang due
to an infinite loop involving "PDFStreamParser" logic.
This problem occurs in both tika-app 1.28.1 and 2.3.0.
It is also worth noting that Acrobat itself will become unresponsive if
attempting to open this document.
To reproduce the problem, just run:
java -jar tika-app-1.28.1.jar map.pdf
--
This message was sent by Atlassian Jira
(v8.20.1#820001)