[ https://issues.apache.org/jira/browse/TIKA-4381 ]
Tim Allison deleted comment on TIKA-4381: ----------------------------------- was (Author: talli...@mitre.org): Does anyone have any links/resources for the property ids? I did what I could here: [github|https://github.com/apache/tika/blob/TIKA-4381/tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/msg/ExtendedMetadataExtractor.java#L43] Specifically: {noformat} static { //TODO -- figure out how these differ and how they overlap with other types PROPERTIES.put(0x8003, MAPI.APPT_START_TIME); PROPERTIES.put(0x8005, MAPI.APPT_START_TIME); PROPERTIES.put(0x8007, MAPI.APPT_START_TIME); PROPERTIES.put(0x8009, MAPI.APPT_START_TIME); PROPERTIES.put(0x801b, MAPI.APPT_START_TIME); PROPERTIES.put(0x8004, MAPI.APPT_END_TIME); PROPERTIES.put(0x8006, MAPI.APPT_END_TIME); PROPERTIES.put(0x801c, MAPI.APPT_END_TIME); PROPERTIES.put(0x8015, MAPI.APPT_END_REPEAT_TIME); } {noformat} I don't see these values here: [ms-oxprops|https://learn.microsoft.com/en-us/openspecs/exchange_server_protocols/ms-oxprops/f6ab1613-aefe-447d-a49c-18217230b148 ] > Improve extraction of metadata from Appointment/Task msgs > --------------------------------------------------------- > > Key: TIKA-4381 > URL: https://issues.apache.org/jira/browse/TIKA-4381 > Project: Tika > Issue Type: Task > Reporter: Tim Allison > Priority: Major > Attachments: Parser.java > > > Our metadata extraction on msgs is mostly focused on "NOTE"/regular emails. > We could do to improve extraction from appointments, tasks and other msg > types. -- This message was sent by Atlassian Jira (v8.20.10#820010)