[jira] Assigned: (TIKA-515) MimeType.getDescription() often returns nothing when "tika-mimetypes.xml" has a useful description already available.

2010-09-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned TIKA-515: -- Assignee: Chris A. Mattmann > MimeType.getDescription() often returns nothing when "tika-

[jira] Updated: (TIKA-515) MimeType.getDescription() often returns nothing when "tika-mimetypes.xml" has a useful description already available.

2010-09-14 Thread Miroslav Pokorny (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miroslav Pokorny updated TIKA-515: -- Component/s: mime Forgot to set component=mime > MimeType.getDescription() often returns nothing

[jira] Created: (TIKA-515) MimeType.getDescription() often returns nothing when "tika-mimetypes.xml" has a useful description already available.

2010-09-14 Thread Miroslav Pokorny (JIRA)
MimeType.getDescription() often returns nothing when "tika-mimetypes.xml" has a useful description already available. - Key: TIKA-515 URL: https://

[jira] Commented: (TIKA-484) xlsx files created with open office are detected as application/zip

2010-09-14 Thread Victor Kazakov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12909542#action_12909542 ] Victor Kazakov commented on TIKA-484: - I passed the file name to the parser and it was ab

[jira] Resolved: (TIKA-484) xlsx files created with open office are detected as application/zip

2010-09-14 Thread Victor Kazakov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Victor Kazakov resolved TIKA-484. - Resolution: Not A Problem > xlsx files created with open office are detected as application/zip > -

[jira] Resolved: (TIKA-408) Word 6.0/7.0 documents support in office parser

2010-09-14 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-408. - Fix Version/s: 0.8 Resolution: Fixed Sorry, I'd forgotten to add the catch + call of the alternate p

[jira] Updated: (TIKA-506) Improve doc and docx parsing to include more things

2010-09-14 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch updated TIKA-506: Attachment: tika-word6.patch The attached patch improves the parsing of .docx to include headings, hyperlink

[jira] Resolved: (TIKA-514) Provide constructor for AutoDetectParser that has explicit list of supported parsers

2010-09-14 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler resolved TIKA-514. -- Resolution: Fixed Committed: http://svn.apache.org/viewvc/?rev=996984 > Provide constructor for AutoDete

[jira] Updated: (TIKA-514) Provide constructor for AutoDetectParser that has explicit list of supported parsers

2010-09-14 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler updated TIKA-514: - Attachment: TIKA-514.patch > Provide constructor for AutoDetectParser that has explicit list of supported

[jira] Commented: (TIKA-514) Provide constructor for AutoDetectParser that has explicit list of supported parsers

2010-09-14 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12909324#action_12909324 ] Ken Krugler commented on TIKA-514: -- Just to capture all of the permutations, I'd proposed an

[jira] Commented: (TIKA-514) Provide constructor for AutoDetectParser that has explicit list of supported parsers

2010-09-14 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12909322#action_12909322 ] Ken Krugler commented on TIKA-514: -- Another suggestion was to catch exceptions thrown in the

[jira] Commented: (TIKA-514) Provide constructor for AutoDetectParser that has explicit list of supported parsers

2010-09-14 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12909317#action_12909317 ] Ken Krugler commented on TIKA-514: -- As Jukka noted, the CompositeParser class could be clean

[jira] Created: (TIKA-514) Provide constructor for AutoDetectParser that has explicit list of supported parsers

2010-09-14 Thread Ken Krugler (JIRA)
Provide constructor for AutoDetectParser that has explicit list of supported parsers Key: TIKA-514 URL: https://issues.apache.org/jira/browse/TIKA-514 Project: Tika

[jira] Commented: (TIKA-408) Word 6.0/7.0 documents support in office parser

2010-09-14 Thread Adam Wilmer (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12909268#action_12909268 ] Adam Wilmer commented on TIKA-408: -- I see POI 3.7-beta2 with this change is released and the

[jira] Created: (TIKA-513) Support of Deja Vu (DjVu) format

2010-09-14 Thread Oleg Tikhonov (JIRA)
Support of Deja Vu (DjVu) format Key: TIKA-513 URL: https://issues.apache.org/jira/browse/TIKA-513 Project: Tika Issue Type: New Feature Components: parser Reporter: Oleg Tikhonov It m