[ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14158170#comment-14158170 ]
Tim Allison commented on TIKA-1427: ----------------------------------- We're currently iterating through the images once we hit the bottom of the page. I don't have the skill/knowledge/time to do the math to figure out where the images are in relationship to the text. Sorry! If there's example code somewhere, let us know. The single img tag is an issue, tho. Let me take a look. > PDF Images don't appear in structured view > ------------------------------------------ > > Key: TIKA-1427 > URL: https://issues.apache.org/jira/browse/TIKA-1427 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.6 > Reporter: James Baker > Assignee: Tim Allison > Labels: pdf > > When viewing, say, a Word Document, any images appear in the 'structured > view' of the document as <img> tags. The same is not true of PDF documents, > and we lose both the fact that there is an image present, and where it is in > the document. > Some discussion of this issue in the comments of TIKA-1396. -- This message was sent by Atlassian JIRA (v6.3.4#6332)