[ https://issues.apache.org/jira/browse/TIKA-2068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15473929#comment-15473929 ]
Nam-Quang Tran commented on TIKA-2068: -------------------------------------- Some more information on the issue: 1) The above crash occurred with Tika 1.11, not Tika 1.13. In Tika 1.13, I get a similar crash: Exception in thread "main" java.lang.NullPointerException at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:89) at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:170) at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:577) at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:69) at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:218) at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:341) at org.apache.tika.parser.rtf.RTFEmbObjHandler.getConfig(RTFEmbObjHandler.java:263) at org.apache.tika.parser.rtf.RTFEmbObjHandler.getExtension(RTFEmbObjHandler.java:242) at org.apache.tika.parser.rtf.RTFEmbObjHandler.extractObj(RTFEmbObjHandler.java:219) at org.apache.tika.parser.rtf.RTFEmbObjHandler.handleCompletedObject(RTFEmbObjHandler.java:198) at org.apache.tika.parser.rtf.TextExtractor.processGroupEnd(TextExtractor.java:1357) at org.apache.tika.parser.rtf.TextExtractor.extract(TextExtractor.java:456) at org.apache.tika.parser.rtf.TextExtractor.extract(TextExtractor.java:439) at org.apache.tika.parser.rtf.RTFParser.parse(RTFParser.java:87) at test.Test.main(Test.java:23) 2) The above crashes occur only when the Tika source code is directly imported into a test project (without any modifications). No crash happens with the Tika app jar. So it appears as if some configuration is necessary to make things work. However, I have not observed this phenomenon with other Tika parsers that were also imported as source code files (e.g. MP3 parser, CHM parser). 3) For the sake of completeness, here's the test code I used to crash the RTF parser: {{ public class Test { public static void main(String[] args) throws Exception { String path = "/media/data/Test/Styrodur C - 2800_C.rtf"; File file = new File(path); InputStream in = new FileInputStream(file); BodyContentHandler bodyHandler = new BodyContentHandler(-1); Metadata metadata = new Metadata(); new RTFParser().parse(in, bodyHandler, metadata, new ParseContext()); } } }} > RTFParser crashes with NullPointerException > ------------------------------------------- > > Key: TIKA-2068 > URL: https://issues.apache.org/jira/browse/TIKA-2068 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.13 > Reporter: Nam-Quang Tran > Attachments: Styrodur C - 2800_C.rtf > > > The RTFParser seems to crash on RTF files containing pictures. The attached > file produces the following stacktrace: > java.lang.NullPointerException > at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:90) > at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:174) > at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:577) > at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:69) > at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:218) > at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:341) > at > org.apache.tika.parser.rtf.RTFEmbObjHandler.getConfig(RTFEmbObjHandler.java:263) > at > org.apache.tika.parser.rtf.RTFEmbObjHandler.getExtension(RTFEmbObjHandler.java:242) > at > org.apache.tika.parser.rtf.RTFEmbObjHandler.extractObj(RTFEmbObjHandler.java:219) > at > org.apache.tika.parser.rtf.RTFEmbObjHandler.handleCompletedObject(RTFEmbObjHandler.java:198) > at > org.apache.tika.parser.rtf.TextExtractor.processGroupEnd(TextExtractor.java:1357) > at org.apache.tika.parser.rtf.TextExtractor.extract(TextExtractor.java:456) > at org.apache.tika.parser.rtf.TextExtractor.extract(TextExtractor.java:439) > at org.apache.tika.parser.rtf.RTFParser.parse(RTFParser.java:87) -- This message was sent by Atlassian JIRA (v6.3.4#6332)