[ 
https://issues.apache.org/jira/browse/TIKA-4395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17938223#comment-17938223
 ] 

james commented on TIKA-4395:
-----------------------------

[~tallison] yeah, i could share the file with you directly.  i can see your 
consulting email on linkedin, should i send it there?

a) no exceptions

b) no difference with/without streaming

c) i get speaker notes from slides that have them, and there are some images 
getting ocr'ed.  it looks like there is an "output" for each slide, just no 
content

d) calling Tika directly in our application using pretty much defaults for 
everything

e) have not tried the 3.x branch.  i can see if i can give that a go

> cannot get any slide content for pptx file
> ------------------------------------------
>
>                 Key: TIKA-4395
>                 URL: https://issues.apache.org/jira/browse/TIKA-4395
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 2.9.3
>            Reporter: james
>            Priority: Major
>
> i have a reasonably large pptx file from which i don't get any slide content. 
>  i get slide notes, and some ocr from embedded images, but not the slide 
> content itself.  unfortunately, i cannot share the file, but i can answer 
> questions about it if necessary (and can probably share some of the internal 
> structure related files). 
>  
> using poi 5.4.0, not in streaming mode.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to