[ 
https://issues.apache.org/jira/browse/TIKA-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Leszek Sliwko updated TIKA-4314:
--------------------------------
    Attachment: CompositeParser.java

Hi, please check the attached changes to CompositeParser (lines: 96-137).
I imagine something like (but it doesn't work correctly yet).








-- 
Leszek Sliwko
System Designer
https://www.linkedin.com/in/leszeksliwko/


> CompositeParser returns only one parser per content type
> --------------------------------------------------------
>
>                 Key: TIKA-4314
>                 URL: https://issues.apache.org/jira/browse/TIKA-4314
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 2.9.2
>            Reporter: Leszek Sliwko
>            Priority: Major
>         Attachments: CompositeParser.java, duration-test-2.avi, 
> geolocation-test-1.jpg, geolocation-test-2.jpg
>
>
> External parsers can have many supported content types, but information is 
> lost in CompositeParser:
>  
> public Map<MediaType, Parser> getParsers(ParseContext context) {
>   Map<MediaType, Parser> map = new HashMap<>();
>   for (Parser parser : parsers) {
>     for (MediaType type : parser.getSupportedTypes(context))
> {        map.put(registry.normalize(type), parser); }
>    }
>    return map;
> }
>  
> To recreate - parse any avi file (content type: video/x-msvideo), Only the 
> exiftool will by picked up and the ffmpeg parser won't be executed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to