Custom metadata from more formats
---------------------------------
Key: TIKA-652
URL: https://issues.apache.org/jira/browse/TIKA-652
Project: Tika
Issue Type: Improvement
Components: parser
Affects Versions: 0.9
Reporter: Nick Burch
Assignee: Nick Burch
Currently, Tika handles custom metadata from Open Document files. Any custom
metadata is returned with a custom: prefix (see
OpenOfficeParserTest#testOO2Metadata for example)
Microsoft file formats don't include custom metadata in the parsing, and nor
does PDF
Assuming we're happy with including custom metadata from Documents in the
parsing step, with the custom: prefix, I'll go ahead and add it for the
Microsoft (ole2 and ooxml) and PDF parsers
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira