[ https://issues.apache.org/jira/browse/TIKA-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17893842#comment-17893842 ]
Tim Allison commented on TIKA-4314: ----------------------------------- Sorry for dropping the ball on this. The SupplementingParser is definitely the way to go with this. I looked at it this morning and we haven't wired up the serialization/configuration so that you can easily specify which component parsers go into that Parser. If we did that, then we would be able to configure {{o.a.t.p.external2.ExternalParser}}s for each commandline you wanted. If you don't need to configure this via xml, e.g. you're running Tika programmatically, this should be not too hard. > CompositeParser returns only one parser per content type > -------------------------------------------------------- > > Key: TIKA-4314 > URL: https://issues.apache.org/jira/browse/TIKA-4314 > Project: Tika > Issue Type: Bug > Components: core > Affects Versions: 2.9.2 > Reporter: Leszek Sliwko > Priority: Major > Attachments: duration-test-2.avi, geolocation-test-1.jpg, > geolocation-test-2.jpg > > > External parsers can have many supported content types, but information is > lost in CompositeParser: > > public Map<MediaType, Parser> getParsers(ParseContext context) { > Map<MediaType, Parser> map = new HashMap<>(); > for (Parser parser : parsers) { > for (MediaType type : parser.getSupportedTypes(context)) > { map.put(registry.normalize(type), parser); } > } > return map; > } > > To recreate - parse any avi file (content type: video/x-msvideo), Only the > exiftool will by picked up and the ffmpeg parser won't be executed. -- This message was sent by Atlassian Jira (v8.20.10#820010)