[ https://issues.apache.org/jira/browse/TIKA-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134369#comment-14134369 ]
Nick Burch commented on TIKA-1415: ---------------------------------- We have unit tests which show Tika (trunk) successfully detecting and extracting embedded resources (including word documents) from within a PowerPoint .ppt file Any chance you could write a small junit unit test showing your problem? And including a sample powerpoint file if you can't reproduce the issue on the Tika test PPT files. > PowerPoint2003 embedded with word. The embedded file can not be detected. > ------------------------------------------------------------------------- > > Key: TIKA-1415 > URL: https://issues.apache.org/jira/browse/TIKA-1415 > Project: Tika > Issue Type: Bug > Components: detector, parser > Affects Versions: 1.5 > Environment: window7 > Reporter: sunxingzhe > Labels: Tika, poi > > Word2003 or word2007 insert into Powerpoint2003 as embedded file。 > The embedded file‘s type can not be detected。 > The embedded file's content can not be parsed. -- This message was sent by Atlassian JIRA (v6.3.4#6332)