Hi there!

I have a question about Changelog Stream Processing with Flink SQL and the
Flink Table API. I would like to better understand how UPDATE_BEFORE fields
are used by Flink.

Our team uses Debezium to extract Change Data Capture events from MySQL
databases. We currently redact the `before` fields in the envelope [0] so
that redacted PII doesn't sit in our Kafka topics in the `before` field of
UPDATE events.

As a result if we were to consume these CDC streams with Flink, there would
be missing UPDATE_BEFORE fields for UPDATE events. What kind of impact
would this have on performance and correctness, if any? Any other
considerations we should be aware of?

Thanks in advance for your help!


[0] https://debezium.io/documentation/reference/stable/connectors/mysql.html

Reply via email to