Stephen H created TIKA-4461:
-------------------------------

             Summary: In RFC822Parser support Content-Id for parts
                 Key: TIKA-4461
                 URL: https://issues.apache.org/jira/browse/TIKA-4461
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 3.2.1
            Reporter: Stephen H
         Attachments: mail-parser-patch.txt

Currently RFC822Parser won't store in the metadata for a message part the 
part's Content-Id field. This means it's not possible to relate a cid: URL in 
message HTML to the part that has that content.

The attached patch adds a new MESSAGE_CONTENT_ID property to the Message 
properties and then MailContentHandler adds this from the part field when it is 
present, normally just for inline parts.

Modified an existing test to check for this which is hopefully okay.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to