Change vote: -1 I whittled this down to 3 files in pure POI and/or a standalone project that includes POI. This is a blocker in my mind. After a parser hits one of these errors, all the rest of the ppts fail.
This affects 3.13 final and 3.14-beta1. I don't know why I didn't find this sooner. Argh... Now we can do the git-bisect that Dominik recommended. See: https://bz.apache.org/bugzilla/show_bug.cgi?id=58718 -----Original Message----- From: Allison, Timothy B. [mailto:talli...@mitre.org] Sent: Thursday, December 10, 2015 2:27 PM To: POI Developers List <dev@poi.apache.org> Subject: RE: Vote on 3.14-beta1 +1 I just finished the run against 38k documents. We're getting more attachments from doc files, and ~251 ppt files are no longer throwing exceptions. I did discover a potential multithreading issue in ppts, but I can only reproduce it so far with tika-app in batch mode when I run against files sorted by mime type (all ppts at once). I can reproduce it for 3.13 with the same set up (tika-app, batch mode with a list of files sorted by mime type). I can't reproduce it yet in junit. I'll open an issue on our tracker for that. Cheers, Tim