[jira] [Commented] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

2012-02-07 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202962#comment-13202962 ] Nick Burch commented on TIKA-857: - Looking at the patch, my only comment is wondering if we

[jira] [Commented] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

2012-02-07 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202928#comment-13202928 ] Nick Burch commented on TIKA-857: - Not sure that this issue should have been resolved, as th

[jira] [Commented] (TIKA-847) Add regular expression support to the MagicDetector

2012-02-07 Thread Andrew Jackson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202883#comment-13202883 ] Andrew Jackson commented on TIKA-847: - As far as I can tell, Lucene's FSM engine is so v

[jira] [Resolved] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

2012-02-07 Thread Craig Stires (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Craig Stires resolved TIKA-857. --- Resolution: Fixed Fix Version/s: 1.1 > Tika TrueTypeParser add metadata from Naming tables >

[jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

2012-02-07 Thread Marek Slama (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202473#comment-13202473 ] Marek Slama commented on TIKA-792: -- We have the same problem. But this time it is Jackrabbi

TF-IDF parser and ContentHandler?

2012-02-07 Thread Mattmann, Chris A (388J)
Hey Guys, I've been toying around with the idea of writing a simple Tika Parser Decorator that extends the Text Parser, but that generates TDF-IDF metadata maybe top word count (summarized) and frequencies/term map. I was also thinking of then writing a similar ContentHandler as well so it could

[jira] [Commented] (TIKA-853) java.io.IOException with TikaGUI and testMP4.m4a

2012-02-07 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202238#comment-13202238 ] Nick Burch commented on TIKA-853: - We don't want to have a System.gc call in production code