Subbu created TIKA-4365:
---------------------------

             Summary: Support Android Bundle aab detection
                 Key: TIKA-4365
                 URL: https://issues.apache.org/jira/browse/TIKA-4365
             Project: Tika
          Issue Type: Bug
          Components: tika-core
            Reporter: Subbu


AAB file goes through DefaultZipContainerDetector and gets detected as

_application/java-archive_ since it has MANIFEST.MF via JarDetector. 

They have their own content type as mentioned in tika-mimetypes.xml  - 
application/x-authorware-bin

[https://github.com/apache/tika/blob/6a12d7cf1a2ee37dce899933a05520604330c947/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml#L3485]


The AAB file structure has a AndroidManifest.xml similar to apk archive but in 
[base/manifest/|http://base/manifest/] directory.

Android Dev documentation reference : 
[https://developer.android.com/guide/app-bundle/app-bundle-format]

Also adding BundleConfig.pb for better accuracy may help.







 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to