Runkang He created FLINK-36573:
----------------------------------

             Summary: Add table information of binlog offsets when checkpointing
                 Key: FLINK-36573
                 URL: https://issues.apache.org/jira/browse/FLINK-36573
             Project: Flink
          Issue Type: Improvement
          Components: Flink CDC
    Affects Versions: cdc-3.2.1
            Reporter: Runkang He
             Fix For: cdc-3.2.1


For mysql binlog split, currently when checkpointing, we only print the binlog 
offset in logs, and difficult to know which table the offset belongs to, 
especially when there are multiple mysql cdc sources in one job. 
{code:java}
2024-09-27 18:16:10,488 INFO  
org.apache.flink.cdc.connectors.mysql.source.reader.MySqlSourceReader [] - 
Binlog offset on checkpoint 4: {transaction_id=null, ts_sec=0, 
file=mysql-binlog.000666, pos=147223270, kind=SPECIFIC, 
gtids=e6e6e8e3-6351-11ef-8340-52540bd1ce99:81433397-81559559, row=0, event=20, 
server_id=1}
{code}

So it's very useful to add table information of binlog offsets, to help user 
clearly see the binlog reading progress of each table. Another benefit is that 
when the job topology modification causes job status incompatibility, users can 
use the binlog file and position information here to restart from the 
specific-offsets mode to recover.

{code:java}
2024-09-27 18:16:10,488 INFO  
org.apache.flink.cdc.connectors.mysql.source.reader.MySqlSourceReader [] - 
Binlog offset for tables [test_db.test] on checkpoint 4: {transaction_id=null, 
ts_sec=0, file=mysql-binlog.000666, pos=147223270, kind=SPECIFIC, 
gtids=e6e6e8e3-6351-11ef-8340-52540bd1ce99:81433397-81559559, row=0, event=20, 
server_id=1}
{code}




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to