Subbu created TIKA-4365: --------------------------- Summary: Support Android Bundle aab detection Key: TIKA-4365 URL: https://issues.apache.org/jira/browse/TIKA-4365 Project: Tika Issue Type: Bug Components: tika-core Reporter: Subbu
AAB file goes through DefaultZipContainerDetector and gets detected as _application/java-archive_ since it has MANIFEST.MF via JarDetector. They have their own content type as mentioned in tika-mimetypes.xml - application/x-authorware-bin [https://github.com/apache/tika/blob/6a12d7cf1a2ee37dce899933a05520604330c947/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml#L3485] The AAB file structure has a AndroidManifest.xml similar to apk archive but in [base/manifest/|http://base/manifest/] directory. Android Dev documentation reference : [https://developer.android.com/guide/app-bundle/app-bundle-format] Also adding BundleConfig.pb for better accuracy may help. -- This message was sent by Atlassian Jira (v8.20.10#820010)