[ https://issues.apache.org/jira/browse/TIKA-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17884593#comment-17884593 ]
Leszek Sliwko commented on TIKA-4314: ------------------------------------- I have another example with the attached files - both contain geolocation data that is correctly detected by ExifTool. However, because parsing only occurs in {{{}org.apache.tika.parser.gdal.GDALParser{}}}, the metadata is lost (the GDAL parser does not extract geolocation). I also recall this working fine in 2019. > CompositeParser returns only one parser per content type > -------------------------------------------------------- > > Key: TIKA-4314 > URL: https://issues.apache.org/jira/browse/TIKA-4314 > Project: Tika > Issue Type: Bug > Components: core > Affects Versions: 2.9.2 > Reporter: Leszek Sliwko > Priority: Major > Attachments: geolocation-test-1.jpg, geolocation-test-2.jpg > > > External parsers can have many supported content types, but information is > lost in CompositeParser: > > public Map<MediaType, Parser> getParsers(ParseContext context) { > Map<MediaType, Parser> map = new HashMap<>(); > for (Parser parser : parsers) { > for (MediaType type : parser.getSupportedTypes(context)) > { map.put(registry.normalize(type), parser); } > } > return map; > } > > To recreate - parse any avi file (content type: video/x-msvideo), Only the > exiftool will by picked up and the ffmpeg parser won't be executed. -- This message was sent by Atlassian Jira (v8.20.10#820010)