[ https://issues.apache.org/jira/browse/FLINK-36611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ruan Hang updated FLINK-36611: ------------------------------ Fix Version/s: cdc-3.4.0 (was: cdc-3.3.0) > Add schema info to output of Kafka sink > ----------------------------------------- > > Key: FLINK-36611 > URL: https://issues.apache.org/jira/browse/FLINK-36611 > Project: Flink > Issue Type: New Feature > Components: Flink CDC > Affects Versions: cdc-3.3.0 > Reporter: Yanquan Lv > Priority: Major > Labels: pull-request-available > Fix For: cdc-3.4.0 > > > Currently, the output of Kafka sink in debezium format looks like this: > {code:java} > { > "before": { > "id": 4, > "name": "John", > "address": "New York", > "phone_number": "2222", > "age": 12 > }, > "after": { > "id": 4, > "name": "John", > "address": "New York", > "phone_number": "1234", > "age": 12 > }, > "op": "u", > "source": { > "db": null, > "table": "customers" > } > } {code} > It contains record data with full before/after and db info, but schema info > wasn't included. > However, In some scenarios, we need this information to determine the type of > data. For example, Paimon's Kafka CDC source requires this type information, > otherwise all types are considered String, refer to > [https://paimon.apache.org/docs/0.9/flink/cdc-ingestion/kafka-cdc/#supported-formats.] > Considering that this will increase the data load, I suggest adding a > parameter to configure whether to enable it. -- This message was sent by Atlassian Jira (v8.20.10#820010)