[
https://issues.apache.org/jira/browse/TIKA-3999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725530#comment-17725530
]
Gregory Lepore commented on TIKA-3999:
--------------------------------------
I'm all for increasing the accuracy of format identification, hence the large
effort to document hundreds of MOD formats. However... Most have poorly
documented and highly variable format structures, so I'm not sure you would be
able to pull much information out of the files without finding original
documentation or reverse engineering the formats. And since these formats are
already most of 25 years old...
That being said, identification is the first step to content and metadata
extraction, plus it would probably minimize false positives in other files.
I could see implementing the identification of the files, but not worrying too
much about pulling anything out of them. Not my call, just trying to help out
in my areas of expertise. (Plus, Tim's title for the issue was a bit, um,
sparse!)
> audio/xm audio/x-mod
> --------------------
>
> Key: TIKA-3999
> URL: https://issues.apache.org/jira/browse/TIKA-3999
> Project: Tika
> Issue Type: Sub-task
> Reporter: Tim Allison
> Priority: Major
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)