[ 
https://issues.apache.org/jira/browse/TIKA-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14086189#comment-14086189
 ] 

Tim Allison commented on TIKA-1380:
-----------------------------------

Something along these lines:
{noformat}
if (type == POIFSDocumentType.OLE10_NATIVE) {
                try {
                    // Try to un-wrap the OLE10Native record:
                    Ole10Native ole = 
Ole10Native.createFromEmbeddedOleObject((DirectoryNode)dir);
                    String label = ole.getLabel();

                    if (label == null){
                        label = "unknown_"+unknownFileNum++;
                    }
                    metadata.set(Metadata.RESOURCE_NAME_KEY, dir.getName() + 
'/' + label);
{noformat}

yields a RESOURCE_NAME_KEY of:
MBD002B0FA6/unknown_0

Or, if label is null, do we just want to use the directory name?  Perhaps with 
an unknownFileNum appended to prevent clashes?

> Upgrade to Apache POI 3.11 beta 1
> ---------------------------------
>
>                 Key: TIKA-1380
>                 URL: https://issues.apache.org/jira/browse/TIKA-1380
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.6
>            Reporter: Nick Burch
>             Fix For: 1.6, 1.7
>
>         Attachments: TIKA-1380.patch, TIKA-1380b.patch, TIKA-1380c.patch, 
> tika-commentstable-missing.diff
>
>
> All being well, in a week there'll be a new release of Apache POI available, 
> 3.11 beta 1
> This issue is to track the upgrade, any required changes, and fixing any 
> TODOs that this upgrade permits



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to