[ https://issues.apache.org/jira/browse/TIKA-627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13014007#comment-13014007 ]
Nick Burch commented on TIKA-627: --------------------------------- For the XML based ones, we should be able to do some things already, and I guess the main thing would be the definitions to extract out the common metadata elements. For anything that pre-dates the XML formats, we'd likely need a suitable Java library. First up, could you provide some sample X12 xml files that we can use for testing, along with a list of the metadata you'd expect Tika to be extracting from them? > Support X12 files > ----------------- > > Key: TIKA-627 > URL: https://issues.apache.org/jira/browse/TIKA-627 > Project: Tika > Issue Type: New Feature > Components: mime, parser > Reporter: Jukka Zitting > Priority: Minor > > X12 [1] is a standardized data interchange format. It would be nice if Tika > could understand such files. > [1] http://en.wikipedia.org/wiki/ASC_X12 -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira