[ 
https://issues.apache.org/jira/browse/TIKA-2068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15473929#comment-15473929
 ] 

Nam-Quang Tran commented on TIKA-2068:
--------------------------------------

Some more information on the issue:

1) The above crash occurred with Tika 1.11, not Tika 1.13. In Tika 1.13, I get 
a similar crash:

Exception in thread "main" java.lang.NullPointerException
at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:89)
at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:170)
at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:577)
at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:69)
at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:218)
at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:341)
at 
org.apache.tika.parser.rtf.RTFEmbObjHandler.getConfig(RTFEmbObjHandler.java:263)
at 
org.apache.tika.parser.rtf.RTFEmbObjHandler.getExtension(RTFEmbObjHandler.java:242)
at 
org.apache.tika.parser.rtf.RTFEmbObjHandler.extractObj(RTFEmbObjHandler.java:219)
at 
org.apache.tika.parser.rtf.RTFEmbObjHandler.handleCompletedObject(RTFEmbObjHandler.java:198)
at 
org.apache.tika.parser.rtf.TextExtractor.processGroupEnd(TextExtractor.java:1357)
at org.apache.tika.parser.rtf.TextExtractor.extract(TextExtractor.java:456)
at org.apache.tika.parser.rtf.TextExtractor.extract(TextExtractor.java:439)
at org.apache.tika.parser.rtf.RTFParser.parse(RTFParser.java:87)
at test.Test.main(Test.java:23)

2) The above crashes occur only when the Tika source code is directly imported 
into a test project (without any modifications). No crash happens with the Tika 
app jar. So it appears as if some configuration is necessary to make things 
work. However, I have not observed this phenomenon with other Tika parsers that 
were also imported as source code files (e.g. MP3 parser, CHM parser).

3) For the sake of completeness, here's the test code I used to crash the RTF 
parser:

{{
public class Test {

        public static void main(String[] args) throws Exception {
                String path = "/media/data/Test/Styrodur C - 2800_C.rtf";
                File file = new File(path);
                InputStream in = new FileInputStream(file);
                BodyContentHandler bodyHandler = new BodyContentHandler(-1);
                Metadata metadata = new Metadata();
                new RTFParser().parse(in, bodyHandler, metadata, new 
ParseContext());
        }

}
}}

> RTFParser crashes with NullPointerException
> -------------------------------------------
>
>                 Key: TIKA-2068
>                 URL: https://issues.apache.org/jira/browse/TIKA-2068
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.13
>            Reporter: Nam-Quang Tran
>         Attachments: Styrodur C - 2800_C.rtf
>
>
> The RTFParser seems to crash on RTF files containing pictures. The attached 
> file produces the following stacktrace:
> java.lang.NullPointerException
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:90)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:174)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:577)
> at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:69)
> at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:218)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:341)
> at 
> org.apache.tika.parser.rtf.RTFEmbObjHandler.getConfig(RTFEmbObjHandler.java:263)
> at 
> org.apache.tika.parser.rtf.RTFEmbObjHandler.getExtension(RTFEmbObjHandler.java:242)
> at 
> org.apache.tika.parser.rtf.RTFEmbObjHandler.extractObj(RTFEmbObjHandler.java:219)
> at 
> org.apache.tika.parser.rtf.RTFEmbObjHandler.handleCompletedObject(RTFEmbObjHandler.java:198)
> at 
> org.apache.tika.parser.rtf.TextExtractor.processGroupEnd(TextExtractor.java:1357)
> at org.apache.tika.parser.rtf.TextExtractor.extract(TextExtractor.java:456)
> at org.apache.tika.parser.rtf.TextExtractor.extract(TextExtractor.java:439)
> at org.apache.tika.parser.rtf.RTFParser.parse(RTFParser.java:87)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to