[jira] [Resolved] (TIKA-799) ForkParser does not populate metadata object after completing a parse

2012-11-07 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-799. Resolution: Fixed Fix Version/s: 1.3 Assignee: Jukka Zitting Fixed in revision 140659

Build failed in Jenkins: Tika-trunk #938

2012-11-07 Thread Apache Jenkins Server
See Changes: [jukka] TIKA-799: ForkParser does not populate metadata object after completing a parse Get the metadata from the XHTML head -- [...truncated 127 lines...] mojoSucceeded org.apache.mave

Re: Build failed in Jenkins: Tika-trunk #938

2012-11-07 Thread Jukka Zitting
Hi, On Wed, Nov 7, 2012 at 2:10 PM, Apache Jenkins Server wrote: > Nov 7, 2012 1:10:08 PM > hudson.remoting.SynchronousCommandTransport$ReaderThread run > SEVERE: I/O error in channel channel > java.io.StreamCorruptedException That's Jenkins having some trouble. BR, Jukka Zitting

[jira] [Commented] (TIKA-93) OCR support

2012-11-07 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-93?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13492336#comment-13492336 ] Jukka Zitting commented on TIKA-93: --- JavaOCR looks interesting, and it looks like it's also

[jira] [Commented] (TIKA-1017) DefaultHtmlMapper misses some safe elements

2012-11-07 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13492342#comment-13492342 ] Jukka Zitting commented on TIKA-1017: - The idea behind DefaultHtmlMapper is to try to n

[jira] [Resolved] (TIKA-1009) Expose TextDocument in BoilerpipeContentHandler

2012-11-07 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-1009. - Resolution: Fixed Assignee: Jukka Zitting (was: Ken Krugler) Thanks! Patch applied in revis

[jira] [Commented] (TIKA-1012) Add additional fields to MimeType reader

2012-11-07 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13492428#comment-13492428 ] Jukka Zitting commented on TIKA-1012: - Looks good, though it would be better if such cu

[jira] [Created] (TIKA-1019) Document links in Word documents don't leave a placeholder

2012-11-07 Thread Michael McCandless (JIRA)
Michael McCandless created TIKA-1019: Summary: Document links in Word documents don't leave a placeholder Key: TIKA-1019 URL: https://issues.apache.org/jira/browse/TIKA-1019 Project: Tika

[jira] [Assigned] (TIKA-1019) Document links in Word documents don't leave a placeholder

2012-11-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-1019: Assignee: Michael McCandless > Document links in Word documents don't leave a pl

Jenkins build is back to normal : Tika-trunk #939

2012-11-07 Thread Apache Jenkins Server
See

[jira] [Updated] (TIKA-1019) Document links in Word documents don't leave a placeholder

2012-11-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated TIKA-1019: - Attachment: testDocumentLink.doc TIKA-1019.patch Patch w/ test and fix.

[jira] [Updated] (TIKA-953) Tika failed to recognize non-ustar Tar file?

2012-11-07 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting updated TIKA-953: --- Fix Version/s: (was: 1.2) As of COMPRESS-191, Commons Compress can detect this issue for us. Once C

[jira] [Updated] (TIKA-1012) Add additional fields to MimeType reader

2012-11-07 Thread Ryan McKinley (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan McKinley updated TIKA-1012: Attachment: TIKA-1012-MimeMeta.patch This updates the patch to use tika namespace for custom attribu