[ 
https://issues.apache.org/jira/browse/TIKA-4019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17718337#comment-17718337
 ] 

Tim Allison edited comment on TIKA-4019 at 5/1/23 8:10 PM:
-----------------------------------------------------------

Got it.  Thank you.  What OS are you using and what version of tesseract are 
you running? {{tesseract --version}}

I didn't get any exceptions (I also didn't get any ocr'd text).


was (Author: [email protected]):
Got it.  Thank you.  What version of tesseract are you running? {{tesseract 
--version}}

I didn't get any exceptions (I also didn't get any ocr'd text).

> Animated gif embedded in msg email triggers gifToPix exception
> --------------------------------------------------------------
>
>                 Key: TIKA-4019
>                 URL: https://issues.apache.org/jira/browse/TIKA-4019
>             Project: Tika
>          Issue Type: Bug
>         Environment: Docker image "apache/tika:latest-full"
>            Reporter: adjenks
>            Priority: Minor
>
> I get many of these errors:
> {quote}org.apache.tika.exception.TikaException: TesseractOCRParser bad exit 
> value 1 err msg: Error in gifToPix: failed to read GIF data
> Error in pixReadStreamGif: failed to read gif from file data
> Error in pixReadStream: gif: no pix returned
> Error in pixRead: pix not read
> Error during processing.
> at 
> org.apache.tika.parser.ocr.TesseractOCRParser.runOCRProcess(TesseractOCRParser.java:458)
> at 
> org.apache.tika.parser.ocr.TesseractOCRParser.doOCR(TesseractOCRParser.java:412)
> ...etc...
> {quote}
> The common theme among all of the files producing these errors is that they 
> are Outlook msg files with embedded animated gifs.
> I looked at opening a ticket for Tesseract but I recall their site said 
> something like please open a ticket with the software that uses Tesseract 
> first in case it's a configuration problem or something.
> Any which way, I think animated gifs should just be ignored. I get hundreds 
> of these errors.
> Thank you. Good luck.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to