[jira] [Commented] (TIKA-1132) Parsing some XLS documents hangs entire JVM, requires kill -9

2013-06-11 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13680800#comment-13680800 ] Nick Burch commented on TIKA-1132: -- Thanks for the test file. There's an open bug in poi a

[jira] [Updated] (TIKA-1132) Parsing some XLS documents hangs entire JVM, requires kill -9

2013-06-11 Thread Ryan Krueger (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Krueger updated TIKA-1132: --- Attachment: mod3.xlsx This trivial file triggers the error. > Parsing some XLS docume

[jira] [Commented] (TIKA-1132) Parsing some XLS documents hangs entire JVM, requires kill -9

2013-06-11 Thread Ryan Krueger (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13680775#comment-13680775 ] Ryan Krueger commented on TIKA-1132: I saved the xls file as a new xlsx file, no change

[jira] [Commented] (TIKA-1132) Parsing some XLS documents hangs entire JVM, requires kill -9

2013-06-11 Thread Ryan Krueger (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13680747#comment-13680747 ] Ryan Krueger commented on TIKA-1132: Running jvisualvm and pulling a thread dump I get

[jira] [Created] (TIKA-1135) Incorrect Cardinality and Case in IPTC Metadata Definition

2013-06-11 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-1135: -- Summary: Incorrect Cardinality and Case in IPTC Metadata Definition Key: TIKA-1135 URL: https://issues.apache.org/jira/browse/TIKA-1135 Project: Tika Issue Type:

[jira] [Resolved] (TIKA-1135) Incorrect Cardinality and Case in IPTC Metadata Definition

2013-06-11 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II resolved TIKA-1135. Resolution: Fixed Resolved in r1491935. > Incorrect Cardinality and Case in IPTC Me

[jira] [Updated] (TIKA-1134) ContentHandler gets ignorable whitespace for tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated TIKA-1134: --- Attachment: (was: SOLR-4679__weird_TIKA-1134.patch) > ContentHandler gets ignorable whitespace for ta

[jira] [Comment Edited] (TIKA-1134) ContentHandler gets ignorable whitespace for tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13680544#comment-13680544 ] Hoss Man edited comment on TIKA-1134 at 6/11/13 7:00 PM: - -patch in

[jira] [Updated] (TIKA-1134) ContentHandler gets ignorable whitespace for tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated TIKA-1134: --- Attachment: SOLR-4679__weird_TIKA-1134.patch patch includes a test demonstrating hte problem in Solr, and an e

[jira] [Updated] (TIKA-1134) ContentHandler gets ignorable whitespace for tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated TIKA-1134: --- Attachment: TIKA-1134.patch FWIW: Changing the XHTMLContentHandler.newline() function to delegate to characte

[jira] [Created] (TIKA-1134) ContentHandler gets ignorable whitespace for tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
Hoss Man created TIKA-1134: -- Summary: ContentHandler gets ignorable whitespace for tags when parsing HTML Key: TIKA-1134 URL: https://issues.apache.org/jira/browse/TIKA-1134 Project: Tika Issue Ty