Jing Li created TIKA-953:
Summary: Tika failed to recognize non-ustar Tar file?
Key: TIKA-953
URL: https://issues.apache.org/jira/browse/TIKA-953
Project: Tika
Issue Type: Bug
Components:
[
https://issues.apache.org/jira/browse/TIKA-953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413609#comment-13413609
]
Nick Burch commented on TIKA-953:
-
Any chance you could share a file that demonstrates the p
Rob Tulloh created TIKA-954:
---
Summary: Tika throws OOM and GC limited exceeded on Microsoft docx
file
Key: TIKA-954
URL: https://issues.apache.org/jira/browse/TIKA-954
Project: Tika
Issue Type: Bu
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rob Tulloh updated TIKA-954:
Attachment: Word.docx
The docx file that causes the error.
> Tika throws OOM and GC limited
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413850#comment-13413850
]
Rob Tulloh commented on TIKA-954:
-
> curl -v -T Word.docx http://localhost:9998/tika
* About
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413863#comment-13413863
]
Nick Burch commented on TIKA-954:
-
How much memory are you giving to the Tika process? Did y
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13413904#comment-13413904
]
Rob Tulloh commented on TIKA-954:
-
We have been running with 600M. We are now increasing the
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13414076#comment-13414076
]
Rob Tulloh commented on TIKA-954:
-
Turns out we are running on CentOS 5.x so I can test with
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13414118#comment-13414118
]
Rob Tulloh commented on TIKA-954:
-
We can provide you the JVM heap dump if you think that is
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13414296#comment-13414296
]
Rob Tulloh commented on TIKA-954:
-
We bumped the JVM size to 2 GB. We now get an empty reply
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13414297#comment-13414297
]
Rob Tulloh commented on TIKA-954:
-
curl output:
* Connected to localhost (127.0.0.1) port 9
11 matches
Mail list logo