[ https://issues.apache.org/jira/browse/TIKA-4381 ]


    Tim Allison deleted comment on TIKA-4381:
    -----------------------------------

was (Author: talli...@mitre.org):
Does anyone have any links/resources for the property ids?

I did what I could here: 
[github|https://github.com/apache/tika/blob/TIKA-4381/tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/parser/microsoft/msg/ExtendedMetadataExtractor.java#L43]

Specifically:
{noformat}
    static {
        //TODO -- figure out how these differ and how they overlap with other 
types
        PROPERTIES.put(0x8003, MAPI.APPT_START_TIME);
        PROPERTIES.put(0x8005, MAPI.APPT_START_TIME);
        PROPERTIES.put(0x8007, MAPI.APPT_START_TIME);
        PROPERTIES.put(0x8009, MAPI.APPT_START_TIME);
        PROPERTIES.put(0x801b, MAPI.APPT_START_TIME);

        PROPERTIES.put(0x8004, MAPI.APPT_END_TIME);
        PROPERTIES.put(0x8006, MAPI.APPT_END_TIME);
        PROPERTIES.put(0x801c, MAPI.APPT_END_TIME);
        PROPERTIES.put(0x8015, MAPI.APPT_END_REPEAT_TIME);
    }
{noformat}

I don't see these values here: 
[ms-oxprops|https://learn.microsoft.com/en-us/openspecs/exchange_server_protocols/ms-oxprops/f6ab1613-aefe-447d-a49c-18217230b148
 ]

> Improve extraction of metadata from Appointment/Task msgs
> ---------------------------------------------------------
>
>                 Key: TIKA-4381
>                 URL: https://issues.apache.org/jira/browse/TIKA-4381
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>         Attachments: Parser.java
>
>
> Our metadata extraction on msgs is mostly focused on "NOTE"/regular emails. 
> We could do to improve extraction from appointments, tasks and other msg 
> types.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to