[ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165592#comment-14165592 ]
Tim Allison commented on TIKA-1427: ----------------------------------- Hmmm...I'm not able to grab the wmf embedded image from the resources.getXObjects(). I also can't export the file with PDFBox's ExtractImages. [~tilman], is this a new feature request for PDFBox (extract inline wmf files), or am I missing the way to do it with the current PDFBox API? Thank you! > PDF Images don't appear in structured view > ------------------------------------------ > > Key: TIKA-1427 > URL: https://issues.apache.org/jira/browse/TIKA-1427 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.6 > Reporter: James Baker > Assignee: Tim Allison > Labels: pdf > Attachments: images_test.pdf > > > When viewing, say, a Word Document, any images appear in the 'structured > view' of the document as <img> tags. The same is not true of PDF documents, > and we lose both the fact that there is an image present, and where it is in > the document. > Some discussion of this issue in the comments of TIKA-1396. -- This message was sent by Atlassian JIRA (v6.3.4#6332)