[ 
https://issues.apache.org/jira/browse/TIKA-4749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18085936#comment-18085936
 ] 

Hudson commented on TIKA-4749:
------------------------------

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk17 #1401 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk17/1401/])
TIKA-4749 - improve inline handling of metadata only (#2866) (github: 
[https://github.com/apache/tika/commit/363378f7fe98de6521c3e1bb74baa622eb472579])
* (edit) tika-core/src/main/java/org/apache/tika/parser/AutoDetectParser.java
* (add) tika-core/src/main/java/org/apache/tika/parser/MetadataOnlyParse.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/src/main/java/org/apache/tika/parser/pdf/image/ImageGraphicsEngine.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java


> Improve inline image handling in PDFs
> -------------------------------------
>
>                 Key: TIKA-4749
>                 URL: https://issues.apache.org/jira/browse/TIKA-4749
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: TIKA-4749.zip
>
>
> [~birdya22] reported an exception from tesseract reading an extracted inline 
> image from a PDF. We should figure out exactly what's going wrong and fix it.
>  
> [~birdya22]  if you're able to share the triggering pdf with us, that would 
> be helpful...even if offline.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to