Hong-Thai Nguyen created TIKA-1223:
--------------------------------------

             Summary: Extract thumbnail of OOXML Office files
                 Key: TIKA-1223
                 URL: https://issues.apache.org/jira/browse/TIKA-1223
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 1.4
            Reporter: Hong-Thai Nguyen
            Priority: Minor
             Fix For: 1.5


>From Microsoft Office 2007 file formats, thumbnail could be included in 
>package. We can extract this embedded thumbnail for OOXML files.

As discussed in mailing list, we should extract thumbnail as a attachment, not 
as metadata (TIKA-90).

embeddedRelationId format is thumbnail_{i}.{extension}.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to