[ 
https://issues.apache.org/jira/browse/TIKA-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15822253#comment-15822253
 ] 

Tim Allison commented on TIKA-2232:
-----------------------------------

Proposed change if jbig2 is not on the classpath:

PDFParser extractInlineImages adds:
{noformat}
X-TIKA:EXCEPTION:warn : org.apache.pdfbox.filter.MissingImageReaderException: 
Cannot read JBIG2 image: jbig2-imageio is not installed
        at org.apache.pdfbox.filter.Filter.findImageReader(Filter.java:128)
        at org.apache.pdfbox.filter.JBIG2Filter.decode(JBIG2Filter.java:54)
{noformat}
to the metadata of the PDF...

ImageParser checks for JBIG2 in {{try{ Class.forName}} before adding jp2 to 
{{SUPPORTED_TYPES}}.  If jbig2 is not on the cp, then the files are handled by 
the EmptyParser, as they used to be.

> Add JBIG2 image parsing support
> -------------------------------
>
>                 Key: TIKA-2232
>                 URL: https://issues.apache.org/jira/browse/TIKA-2232
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>    Affects Versions: 1.14
>         Environment: Any
>            Reporter: Pascal Essiembre
>            Assignee: Tim Allison
>            Priority: Minor
>             Fix For: 2.0, 1.15
>
>
> If you are interested, I would like to add support for JBIG2 image files 
> (.jb2, or .jbig2).  I have encountered them PDFs.
> I will make a pull-request shortly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to