[
https://issues.apache.org/jira/browse/TIKA-3721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17526655#comment-17526655
]
Tim Allison commented on TIKA-3721:
-----------------------------------
Tika's SummaryExtractor (based on POI) works on these files.
{noformat}
for (File f : new File("....tika-dgn-detector/src/test/resources/dgn" +
"/dgn8").listFiles() ) {
POIFSFileSystem pfs = new POIFSFileSystem(f, true);
DirectoryNode root = pfs.getRoot();
System.out.println("file: " + f.getName());
Metadata metadata = new Metadata();
SummaryExtractor summaryExtractor = new SummaryExtractor(metadata);
summaryExtractor.parseSummaries(root);
System.out.println("ENTRIES");
for (Iterator<Entry> it = root.getEntries(); it.hasNext(); ) {
Entry e = it.next();
String which = "directory";
if (e instanceof DocumentEntry) {
which = "document";
}
System.out.println(e.getName() + " : " + which);
}
System.out.println("METADATA");
debug(metadata);
System.out.println("");
{noformat}
> DGN parser
> ----------
>
> Key: TIKA-3721
> URL: https://issues.apache.org/jira/browse/TIKA-3721
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Affects Versions: 2.3.0
> Reporter: Dan Coldrick
> Priority: Minor
> Attachments: dgn8s-dumped.txt, image-2022-04-22-20-00-45-704.png,
> image-2022-04-22-20-01-09-564.png, image-2022-04-22-20-02-24-180.png
>
>
> Does anyone have any experience with the DGN file format by MicroStation? I
> see TIKA doesn't have a parser so would it be possible to create one?
> https://docs.fileformat.com/cad/dgn/
--
This message was sent by Atlassian Jira
(v8.20.7#820007)