[jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document

2021-07-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384349#comment-17384349 ] Tim Allison commented on TIKA-3224: --- Doh. Sorry. My bad! Thank you! > Stackoverflow

[jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document

2021-07-20 Thread David Pilato (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384332#comment-17384332 ] David Pilato commented on TIKA-3224: Oh I was confused. PDFBox 2.0.24 is in Tika 1.27.

[jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document

2021-07-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384305#comment-17384305 ] Tim Allison commented on TIKA-3224: --- Sorry, I'm not clear on the question...are you aski

[jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document

2021-07-20 Thread David Pilato (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384284#comment-17384284 ] David Pilato commented on TIKA-3224: I just tested Tika 1.27 with PDFBox 2.0.24 and it

[jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document

2020-11-08 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17228005#comment-17228005 ] Tilman Hausherr commented on TIKA-3224: --- related issue is resolved > Stackoverflow

[jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document

2020-11-04 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17226368#comment-17226368 ] Tim Allison commented on TIKA-3224: --- And, to triple check Tika, I extracted the embedded

[jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document

2020-11-04 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17226367#comment-17226367 ] Tim Allison commented on TIKA-3224: --- Oddly (?), I'm not able to extract the file from th

[jira] [Commented] (TIKA-3224) Stackoverflow with Embedded PDF in DOCX document

2020-11-04 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17226366#comment-17226366 ] Tim Allison commented on TIKA-3224: --- I manually extracted the PDF file from the docx, an