[ 
https://issues.apache.org/jira/browse/TIKA-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17884595#comment-17884595
 ] 

Tim Allison edited comment on TIKA-4314 at 9/25/24 12:31 PM:
-------------------------------------------------------------

Do you want GDAL and exiftool for the jpgs? If so, the fix will require more 
than modifying the CompositeExternalParser... which is fine... let me see what 
I can do with what we have.

Example files and use cases are extremely helpful. Thank you!


was (Author: talli...@mitre.org):
Do you want GDAL and exiftool for the jpgs? If so, the fix will require more 
than modifying the CompositeExternalParser. 

Example files and use cases are extremely helpful. Thank you!

> CompositeParser returns only one parser per content type
> --------------------------------------------------------
>
>                 Key: TIKA-4314
>                 URL: https://issues.apache.org/jira/browse/TIKA-4314
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 2.9.2
>            Reporter: Leszek Sliwko
>            Priority: Major
>         Attachments: duration-test-2.avi, geolocation-test-1.jpg, 
> geolocation-test-2.jpg
>
>
> External parsers can have many supported content types, but information is 
> lost in CompositeParser:
>  
> public Map<MediaType, Parser> getParsers(ParseContext context) {
>   Map<MediaType, Parser> map = new HashMap<>();
>   for (Parser parser : parsers) {
>     for (MediaType type : parser.getSupportedTypes(context))
> {        map.put(registry.normalize(type), parser); }
>    }
>    return map;
> }
>  
> To recreate - parse any avi file (content type: video/x-msvideo), Only the 
> exiftool will by picked up and the ffmpeg parser won't be executed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to