[jira] [Commented] (TIKA-1419) Upgrade to PDFBox 1.8.7

2014-09-29 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152855#comment-14152855 ] Tilman Hausherr commented on TIKA-1419: --- Compare PDFBox's trunk against 1.8.x periodi

[jira] [Commented] (TIKA-1427) PDF Images don't appear in structured view

2014-09-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152794#comment-14152794 ] Hudson commented on TIKA-1427: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #238 (See [https://b

[jira] [Commented] (TIKA-605) Tika GDAL parser

2014-09-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152791#comment-14152791 ] Chris A. Mattmann commented on TIKA-605: OK I'm working on this again. First step:

Tika OCR wiki page

2014-09-29 Thread Mattmann, Chris A (3980)
Hey Guys, I wrote a simple wiki page for Tika OCR. Admit it¹s Mac centric instructions for Tesseract: https://wiki.apache.org/tika/TikaOCR If you have improvements to make to the docs, please do! :) Cheers, Chris ++ Chris Mattman

[jira] [Commented] (TIKA-1427) PDF Images don't appear in structured view

2014-09-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152756#comment-14152756 ] Hudson commented on TIKA-1427: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #216 (See [https://b

[jira] [Commented] (TIKA-1433) Extract documents embedded within annotations in PDFs

2014-09-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152741#comment-14152741 ] Hudson commented on TIKA-1433: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #237 (See [https://b

[jira] [Resolved] (TIKA-1427) PDF Images don't appear in structured view

2014-09-29 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1427. --- Resolution: Fixed r1628354. Let me know if the markup is sufficient for your needs. > PDF Images don'

[jira] [Commented] (TIKA-1433) Extract documents embedded within annotations in PDFs

2014-09-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152720#comment-14152720 ] Hudson commented on TIKA-1433: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #215 (See [https://b

[jira] [Resolved] (TIKA-1414) How to extract embedded images from PDFs?

2014-09-29 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1414. --- Resolution: Not a Problem > How to extract embedded images from PDFs? > ---

[jira] [Resolved] (TIKA-1433) Extract documents embedded within annotations in PDFs

2014-09-29 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1433. --- Resolution: Fixed r1628350 > Extract documents embedded within annotations in PDFs > -

[jira] [Created] (TIKA-1433) Extract documents embedded within annotations in PDFs

2014-09-29 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1433: - Summary: Extract documents embedded within annotations in PDFs Key: TIKA-1433 URL: https://issues.apache.org/jira/browse/TIKA-1433 Project: Tika Issue Type: New Fe

tika-trunk-jdk1.6 - Build # 214 - Failure

2014-09-29 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-trunk-jdk1.6 (build #214) Status: Failure Check console output at https://builds.apache.org/job/tika-trunk-jdk1.6/214/ to view the results.

[jira] [Commented] (TIKA-1420) Add Metadata Extraction to Arbitrary Parsers

2014-09-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152634#comment-14152634 ] Hudson commented on TIKA-1420: -- FAILURE: Integrated in tika-trunk-jdk1.6 #214 (See [https://b

[jira] [Commented] (TIKA-1420) Add Metadata Extraction to Arbitrary Parsers

2014-09-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152625#comment-14152625 ] Hudson commented on TIKA-1420: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #236 (See [https://b

[jira] [Resolved] (TIKA-1420) Add Metadata Extraction to Arbitrary Parsers

2014-09-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1420. --- Resolution: Fixed Fix Version/s: 1.7 Assignee: Tyler Palsulich Moved over in r1

[jira] [Commented] (TIKA-1432) some docx files creates exception

2014-09-29 Thread Marco Machado (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152096#comment-14152096 ] Marco Machado commented on TIKA-1432: - Apparently, when documents have images, the exce

[jira] [Updated] (TIKA-1432) some docx files creates exception

2014-09-29 Thread Marco Machado (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Machado updated TIKA-1432: Description: using some docx files (attached files) as input throws exception. Trace: Exception in

[jira] [Updated] (TIKA-1432) some docx files creates exception

2014-09-29 Thread Marco Machado (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Machado updated TIKA-1432: Description: using some docx files (attached files) as input results in exception. Trace: Exceptio

[jira] [Updated] (TIKA-1432) some docx files creates exception

2014-09-29 Thread Marco Machado (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Machado updated TIKA-1432: Attachment: ListaQuestoes2014.docx java.docx > some docx files creates exception > --

[jira] [Updated] (TIKA-1432) some docx files creates exception

2014-09-29 Thread Marco Machado (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Machado updated TIKA-1432: Description: using some docx files as input results in exception. Trace: Exception in thread "main

[jira] [Updated] (TIKA-1432) some docx files creates exception

2014-09-29 Thread Marco Machado (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Machado updated TIKA-1432: Component/s: parser Priority: Minor (was: Major) Affects Version/s: 1.6 > som

[jira] [Created] (TIKA-1432) some docx files creates exception

2014-09-29 Thread Marco Machado (JIRA)
Marco Machado created TIKA-1432: --- Summary: some docx files creates exception Key: TIKA-1432 URL: https://issues.apache.org/jira/browse/TIKA-1432 Project: Tika Issue Type: Bug Report

[jira] [Commented] (TIKA-1431) How to extract embedded images in a document?

2014-09-29 Thread Damiano (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14151679#comment-14151679 ] Damiano commented on TIKA-1431: --- [~gagravarr] Thank you for your fast reply. Ok, so i think

[jira] [Commented] (TIKA-1419) Upgrade to PDFBox 1.8.7

2014-09-29 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14151614#comment-14151614 ] Tim Allison commented on TIKA-1419: --- Thank you! Let me know when I should run 1.8.8 v. 1

[jira] [Commented] (TIKA-1431) How to extract embedded images in a document?

2014-09-29 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14151587#comment-14151587 ] Nick Burch commented on TIKA-1431: -- If you go to http://localhost:9998/ you'll see the lis

[jira] [Updated] (TIKA-1431) How to extract embedded images in a document?

2014-09-29 Thread Damiano (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damiano updated TIKA-1431: -- Environment: *ubuntu 14.04 LTS* {quote} MD A10-5800K APU with Radeon(tm) HD Graphics × 4 {quote} *java version

[jira] [Updated] (TIKA-1431) How to extract embedded images in a document?

2014-09-29 Thread Damiano (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damiano updated TIKA-1431: -- Description: Hello, I just downloaded Tika Server from here: https://archive.apache.org/dist/tika/tika-server-1

[jira] [Updated] (TIKA-1431) How to extract embedded images in a document?

2014-09-29 Thread Damiano (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damiano updated TIKA-1431: -- Description: Hello, I just downloaded Tika Server from here: https://archive.apache.org/dist/tika/tika-server-1

[jira] [Updated] (TIKA-1431) How to extract embedded images in a document?

2014-09-29 Thread Damiano (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damiano updated TIKA-1431: -- Description: Hello, I just downloaded Tika Server from here: https://archive.apache.org/dist/tika/tika-server-1

[jira] [Updated] (TIKA-1431) How to extract embedded images in a document?

2014-09-29 Thread Damiano (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damiano updated TIKA-1431: -- Description: Hello, I just downloaded Tika Server from here: https://archive.apache.org/dist/tika/tika-server-1

[jira] [Created] (TIKA-1431) How to extract embedded images in a document?

2014-09-29 Thread Damiano (JIRA)
Damiano created TIKA-1431: - Summary: How to extract embedded images in a document? Key: TIKA-1431 URL: https://issues.apache.org/jira/browse/TIKA-1431 Project: Tika Issue Type: Bug Componen