Giorgiana Ciobanu created TIKA-3811:
---------------------------------------

             Summary: Exclude NameDetector not working for new 
Tika().detect(file)
                 Key: TIKA-3811
                 URL: https://issues.apache.org/jira/browse/TIKA-3811
             Project: Tika
          Issue Type: Bug
          Components: config, core, detector
    Affects Versions: 2.3.0
            Reporter: Giorgiana Ciobanu
         Attachments: invalid_format.vtt, tika-config_test.xml

I need to detect mime type for a file but for security reason I want to exclude 
the detection by file name extension. 

I added a tika-config_test.xml (see attached) to my unit test but it still 
detects file by name extension.

I attached a test file that is wrongly detected as text/vtt because of the file 
extension, it should be text/plain in this case.

 

The code of my unit test:

 

 
{code:java}
File file = new 
File(getClass().getClassLoader().getResource("invalid_format.vtt").getFile());
TikaConfig tikaConfig = new TikaConfig(this.getClass()
.getClassLoader()
.getResourceAsStream("tika-config_test.xml"));
 
// returns text/vtt but should be text/plain
String mimeType = new Tika(tikaConfig).detect(file); 
{code}
 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to