[
https://issues.apache.org/jira/browse/TIKA-4749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18085936#comment-18085936
]
Hudson commented on TIKA-4749:
------------------------------
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk17 #1401 (See
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk17/1401/])
TIKA-4749 - improve inline handling of metadata only (#2866) (github:
[https://github.com/apache/tika/commit/363378f7fe98de6521c3e1bb74baa622eb472579])
* (edit) tika-core/src/main/java/org/apache/tika/parser/AutoDetectParser.java
* (add) tika-core/src/main/java/org/apache/tika/parser/MetadataOnlyParse.java
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/src/main/java/org/apache/tika/parser/pdf/image/ImageGraphicsEngine.java
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java
> Improve inline image handling in PDFs
> -------------------------------------
>
> Key: TIKA-4749
> URL: https://issues.apache.org/jira/browse/TIKA-4749
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Minor
> Attachments: TIKA-4749.zip
>
>
> [~birdya22] reported an exception from tesseract reading an extracted inline
> image from a PDF. We should figure out exactly what's going wrong and fix it.
>
> [~birdya22] if you're able to share the triggering pdf with us, that would
> be helpful...even if offline.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)