danzhewuju opened a new pull request, #4297:
URL: https://github.com/apache/flink-cdc/pull/4297

   Fix: https://issues.apache.org/jira/browse/FLINK-39198
   
   Flink CDC creates an eventFilter inside the MySqlBinlogSplitReadTask.  
   This filter is produced through the following method. In timestamp 
startOptions mode, the eventFilter will exclude events whose timestamps are 
earlier than the user-defined timestamp. However, if the job resumes from a 
savepoint or checkpoint and the system automatically sets the timestamp 
parameter, this approach can result in data loss.
   
   In this scenario, the user configures a ⁠startOption with timestamp mode 
(which may be the default behavior, tied to the Flink job’s submission time) 
and restarts the Flink CDC job from a savepoint. If the savepoint contains 
MySQL binlog information including a ⁠ts_sec value, and that ⁠ts_sec is earlier 
than the user-defined timestamp, data loss may occur.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to