[ 
https://issues.apache.org/jira/browse/TIKA-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134369#comment-14134369
 ] 

Nick Burch commented on TIKA-1415:
----------------------------------

We have unit tests which show Tika (trunk) successfully detecting and 
extracting embedded resources (including word documents) from within a 
PowerPoint .ppt file

Any chance you could write a small junit unit test showing your problem? And 
including a sample powerpoint file if you can't reproduce the issue on the Tika 
test PPT files.

> PowerPoint2003 embedded with word. The embedded file can not be detected.
> -------------------------------------------------------------------------
>
>                 Key: TIKA-1415
>                 URL: https://issues.apache.org/jira/browse/TIKA-1415
>             Project: Tika
>          Issue Type: Bug
>          Components: detector, parser
>    Affects Versions: 1.5
>         Environment: window7
>            Reporter: sunxingzhe
>              Labels: Tika, poi
>
> Word2003 or word2007  insert into Powerpoint2003 as embedded file。
> The embedded file‘s type can not be detected。
> The embedded file's content can not be parsed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to