[ 
https://issues.apache.org/jira/browse/TIKA-4384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17926417#comment-17926417
 ] 

Tim Allison commented on TIKA-4384:
-----------------------------------

Looks good!
{noformat}
[{"X-TIKA:Parsed-By":["org.apache.tika.parser.DefaultParser","org.apache.tika.parser.pkg.PackageParser"],"X-TIKA:Parsed-By-Full-Set":["org.apache.tika.parser.DefaultParser","org.apache.tika.parser.pkg.PackageParser","org.apache.tika.parser.csv.TextAndCSVParser"],"X-TIKA:content_handler":"ToTextContentHandler","X-TIKA:parse_time_millis":"67","X-TIKA:embedded_depth":"0","X-TIKA:content":"\n\n\n\n\n\n\n\n\n\n\n\ntestzipxtestdemo.txt\n\n","resourceName":"demozipxfile.zipx","X-TIKA:detectedEncoding":"ISO-8859-1","Content-Length":"143","X-TIKA:encodingDetector":"UniversalEncodingDetector","Content-Type":"application/zip"},{"X-TIKA:final_embedded_resource_path":"/testzipxtestdemo.txt","X-TIKA:embedded_id_path":"/1","X-TIKA:content_handler":"ToTextContentHandler","resourceName":"testzipxtestdemo.txt","dcterms:modified":"2025-02-12T15:24:10Z","X-TIKA:encodingDetector":"UniversalEncodingDetector","embeddedRelationshipId":"testzipxtestdemo.txt","X-TIKA:Parsed-By":["org.apache.tika.parser.DefaultParser","org.apache.tika.parser.csv.TextAndCSVParser"],"X-TIKA:embedded_depth":"1","Content-Encoding":"ISO-8859-1","X-TIKA:parse_time_millis":"15","X-TIKA:content":"\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nTest\n\n","X-TIKA:detectedEncoding":"ISO-8859-1","Content-Length":"5","X-TIKA:embedded_resource_path":"/testzipxtestdemo.txt","X-TIKA:embedded_id":"1","Content-Type":"text/plain;
 charset=ISO-8859-1"}]
{noformat}

> ZipX not supported in DefaultMimeTypes
> --------------------------------------
>
>                 Key: TIKA-4384
>                 URL: https://issues.apache.org/jira/browse/TIKA-4384
>             Project: Tika
>          Issue Type: Improvement
>          Components: tika-core
>            Reporter: Subbu
>            Priority: Minor
>         Attachments: demozipxfile.zipx
>
>
> ZipX are a specific file extensions that are like Zip files. DefaultMimeTypes 
> not support them causing file name based zip content are detected as 
> application/octet-stream.
> Creating a PR to fix this with test



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to