David Avant created TIKA-3718:
---------------------------------

             Summary: Special PDF document causes Tika parser to hang
                 Key: TIKA-3718
                 URL: https://issues.apache.org/jira/browse/TIKA-3718
             Project: Tika
          Issue Type: Bug
          Components: app
    Affects Versions: 2.3.0, 1.28.1
         Environment: The problem can be reproduced under (Windows + Java8).   
However, the problem does not appear to be environment specific.   
            Reporter: David Avant
         Attachments: map.pdf

Attempting to parse the attached "map.pdf" causes the Tika parser to hang due 
to an infinite loop involving "PDFStreamParser" logic.

This problem occurs in both tika-app 1.28.1 and 2.3.0.

It is also worth noting that Acrobat itself will become unresponsive if 
attempting to open this document.

To reproduce the problem, just run:

java -jar tika-app-1.28.1.jar map.pdf



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to