Thank you, Javen. I worry that I'll be adding duct tape to OPCPackage, but let me put together a patch and we can decide if adding an InlinePackage is too Frankenstein-y for POI. -----Original Message----- From: Javen O'Neal [mailto:javenon...@gmail.com] Sent: Thursday, November 17, 2016 5:58 PM To: POI Developers List <dev@poi.apache.org> Subject: Re: 2006 ML format?
This would probably be of interest to users of POI who are not necessarily using Tika. If someone spends the effort to add support for a Microsoft Office format, POI seems like a better host. On Nov 17, 2016 10:55 AM, "Allison, Timothy B." <talli...@mitre.org> wrote: All, On TIKA-2179 [1], Sean Story submitted a document that appears to be a 2006 ML format .xml file. It appears to inline the components of a regular docx into a single xml file, no zip. Is it worth the effort to build a read-only subclass of OPCPackage (say, InlinePackage) that would parallel our ZipPackage? Or would it be better to handle this purely on the Tika side and rewrite the file as a temporary ZipFile that can be read by our current OPCPackage? Thank you. Best, Tim [1] https://issues.apache.org/jira/browse/TIKA-2179