Hong-Thai Nguyen created TIKA-1223: -------------------------------------- Summary: Extract thumbnail of OOXML Office files Key: TIKA-1223 URL: https://issues.apache.org/jira/browse/TIKA-1223 Project: Tika Issue Type: Improvement Components: parser Affects Versions: 1.4 Reporter: Hong-Thai Nguyen Priority: Minor Fix For: 1.5
>From Microsoft Office 2007 file formats, thumbnail could be included in >package. We can extract this embedded thumbnail for OOXML files. As discussed in mailing list, we should extract thumbnail as a attachment, not as metadata (TIKA-90). embeddedRelationId format is thumbnail_{i}.{extension}. -- This message was sent by Atlassian JIRA (v6.1.5#6160)