Luca Occhipinti created FLINK-39169:
---------------------------------------
Summary: [mysql-connector] Use reader instances to run snapshots
in clustered instances
Key: FLINK-39169
URL: https://issues.apache.org/jira/browse/FLINK-39169
Project: Flink
Issue Type: Improvement
Components: Flink CDC
Reporter: Luca Occhipinti
When running MySQL CDC in snapshot or initial mode (both streaming and batch)
In cloud environments like AWS Aurora/RDS, the connector requires *write
access* to the primary database instance to retrieve the binlog position and
then continues running snapshot queries.
This creates unnecessary load on the primary/writer instance when performing
large snapshot reads, which can impact production workloads.
Usually this there are read replicas specifically designed to offload read
traffic.
However, the current implementation cannot leverage these replicas for snapshot
data reading.
The proposal is to use writer instance to get binlog position, use the reader
replica to run the snapshot queries, and if running in streaming mode, keep
using the writer to track binlog changes
--
This message was sent by Atlassian Jira
(v8.20.10#820010)