Re: Possible re-opening of resolved issue TIKA-738?

2011-11-26 Thread Mattmann, Chris A (388J)
Thanks, Mike, appreciate it. Cheers, Chris On Nov 26, 2011, at 11:53 AM, Michael McCandless wrote: > Hi Chris, > > I agree, makes sense. > > I'll go re-close TIKA-738, and commit my patch / CHANGES entry under > the already opened TIKA-778... > > Mike McCandless > > http://blog.mikemccandles

[jira] [Commented] (TIKA-723) Rotated text isn't extracted correctly from PDFs

2011-11-26 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157602#comment-13157602 ] Michael McCandless commented on TIKA-723: - The sortByPosition option is tricky to de

[jira] [Resolved] (TIKA-778) NullPointerException in tika-app, parsing PDF content

2011-11-26 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-778. - Resolution: Fixed Fix Version/s: 1.1 > NullPointerException in tika-app, parsing

[jira] [Assigned] (TIKA-778) NullPointerException in tika-app, parsing PDF content

2011-11-26 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-778: --- Assignee: Michael McCandless > NullPointerException in tika-app, parsing PDF conten

[jira] [Resolved] (TIKA-738) Tika fails to extract text from PDF annotations

2011-11-26 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-738. - Resolution: Fixed Per discussion on tika-dev I'll leave this issue closed, and commit this

Re: Possible re-opening of resolved issue TIKA-738?

2011-11-26 Thread Michael McCandless
Hi Chris, I agree, makes sense. I'll go re-close TIKA-738, and commit my patch / CHANGES entry under the already opened TIKA-778... Mike McCandless http://blog.mikemccandless.com On Sat, Nov 26, 2011 at 1:54 PM, Mattmann, Chris A (388J) wrote: > Hey Guys, > > Just an FYI my personal preferenc

[jira] [Commented] (TIKA-513) Support of Deja Vu (DjVu) format

2011-11-26 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157597#comment-13157597 ] Nick Burch commented on TIKA-513: - Both DjVuLibre and JavaDjVu are GPL'd, so we couldn't hos

Re: Possible re-opening of resolved issue TIKA-738?

2011-11-26 Thread Mattmann, Chris A (388J)
Hey Guys, Just an FYI my personal preference on things like this are to leave the original issue closed, open up a new issue and to link back to the original one. This is mainly from a release management perspective, where we may have already shipped a CHANGES.txt with a closed issue that gets re

[jira] [Commented] (TIKA-723) Rotated text isn't extracted correctly from PDFs

2011-11-26 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157581#comment-13157581 ] Nick Burch commented on TIKA-723: - I think the idea is to offer these sort of PDFBox options

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

2011-11-26 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157558#comment-13157558 ] Nick Burch commented on TIKA-792: - Your quick fix is to replace the poi-ooxml-schemas jar wi

[jira] [Created] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

2011-11-26 Thread Torsten Krah (Created) (JIRA)
NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document Key: TIKA-792 URL: https://issues.apache.or

[jira] [Updated] (TIKA-738) Tika fails to extract text from PDF annotations

2011-11-26 Thread Michael McCandless (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated TIKA-738: Attachment: TIKA-738.patch Patch, fixing the excess tag. > Tika fails to ex

[jira] [Reopened] (TIKA-738) Tika fails to extract text from PDF annotations

2011-11-26 Thread Michael McCandless (Reopened) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reopened TIKA-738: - Reopening per the discussion on tika-dev; it looks like this fix also caused the NPE in TIKA-7

Re: Possible re-opening of resolved issue TIKA-738?

2011-11-26 Thread Michael McCandless
Yes please go ahead and reopen TIKA-738... sounds like something is wrong! Thanks. Mike McCandless http://blog.mikemccandless.com On Fri, Nov 25, 2011 at 9:25 PM, John M wrote: > Hello, > > When I use the latest build of the Tika application jar's CLI with the > -h option to parse testAnnotati