[
https://issues.apache.org/jira/browse/TIKA-4749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Adrian Bird updated TIKA-4749:
------------------------------
Attachment: TIKA-4749.zip
> Improve inline image handling in PDFs
> -------------------------------------
>
> Key: TIKA-4749
> URL: https://issues.apache.org/jira/browse/TIKA-4749
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Minor
> Attachments: TIKA-4749.zip
>
>
> [~birdya22] reported an exception from tesseract reading an extracted inline
> image from a PDF. We should figure out exactly what's going wrong and fix it.
>
> [~birdya22] if you're able to share the triggering pdf with us, that would
> be helpful...even if offline.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)