Runkang He created FLINK-36683: ---------------------------------- Summary: Support metadata 'row_kind' virtual column for Mongo CDC Connector Key: FLINK-36683 URL: https://issues.apache.org/jira/browse/FLINK-36683 Project: Flink Issue Type: Improvement Components: Flink CDC Affects Versions: cdc-3.3.0, cdc-3.2.1 Reporter: Runkang He
'row_kind' metadata is very useful in actual user scenarios, the two main scenarios are below: 1. Save all upstream messages: In this scenario, the downstream will save all message includes delete messages from upstream. To achieve this requirement, we should convert all kind of changelogs to append only message, and need to use metadata row_kind to represent the changelog kind. 2. Ignore upstream delete messages: In this scenario, the upstream cdc source often deletes historical data regularly to save storage space and only retains data within seven days. However, the business requires the downstream OLAP system to retain the full amount of historical data, so it is necessary to ignore the delete messages from source. So I think we should support 'row_kind' metadata in Mongo CDC Connector. -- This message was sent by Atlassian Jira (v8.20.10#820010)