[jira] [Created] (TIKA-953) Tika failed to recognize non-ustar Tar file?

2012-07-13 Thread Jing Li (JIRA)
Jing Li created TIKA-953: Summary: Tika failed to recognize non-ustar Tar file? Key: TIKA-953 URL: https://issues.apache.org/jira/browse/TIKA-953 Project: Tika Issue Type: Bug Components:

[jira] [Commented] (TIKA-953) Tika failed to recognize non-ustar Tar file?

2012-07-13 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413609#comment-13413609 ] Nick Burch commented on TIKA-953: - Any chance you could share a file that demonstrates the p

[jira] [Created] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Rob Tulloh (JIRA)
Rob Tulloh created TIKA-954: --- Summary: Tika throws OOM and GC limited exceeded on Microsoft docx file Key: TIKA-954 URL: https://issues.apache.org/jira/browse/TIKA-954 Project: Tika Issue Type: Bu

[jira] [Updated] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Rob Tulloh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rob Tulloh updated TIKA-954: Attachment: Word.docx The docx file that causes the error. > Tika throws OOM and GC limited

[jira] [Commented] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Rob Tulloh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413850#comment-13413850 ] Rob Tulloh commented on TIKA-954: - > curl -v -T Word.docx http://localhost:9998/tika * About

[jira] [Commented] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413863#comment-13413863 ] Nick Burch commented on TIKA-954: - How much memory are you giving to the Tika process? Did y

[jira] [Commented] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Rob Tulloh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413904#comment-13413904 ] Rob Tulloh commented on TIKA-954: - We have been running with 600M. We are now increasing the

[jira] [Commented] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Rob Tulloh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13414076#comment-13414076 ] Rob Tulloh commented on TIKA-954: - Turns out we are running on CentOS 5.x so I can test with

[jira] [Commented] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Rob Tulloh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13414118#comment-13414118 ] Rob Tulloh commented on TIKA-954: - We can provide you the JVM heap dump if you think that is

[jira] [Commented] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Rob Tulloh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13414296#comment-13414296 ] Rob Tulloh commented on TIKA-954: - We bumped the JVM size to 2 GB. We now get an empty reply

[jira] [Commented] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file

2012-07-13 Thread Rob Tulloh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13414297#comment-13414297 ] Rob Tulloh commented on TIKA-954: - curl output: * Connected to localhost (127.0.0.1) port 9