Tim Allison created TIKA-4749:
---------------------------------

             Summary: Improve inline image handling in PDFs
                 Key: TIKA-4749
                 URL: https://issues.apache.org/jira/browse/TIKA-4749
             Project: Tika
          Issue Type: Task
            Reporter: Tim Allison


[~birdya22] reported an exception from tesseract reading an extracted inline 
image from a PDF. We should figure out exactly what's going wrong and fix it.

 

[~birdya22]  if you're able to share the triggering pdf with us, that would be 
helpful...even if offline.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to