Jiabao Sun created FLINK-31063:
----------------------------------
Summary: Prevent duplicate reading when restoring from a
checkpoint.
Key: FLINK-31063
URL: https://issues.apache.org/jira/browse/FLINK-31063
Project: Flink
Issue Type: Bug
Components: Connectors / MongoDB
Affects Versions: mongodb-1.0.0
Reporter: Jiabao Sun
Fix For: mongodb-1.0.0
Exact-once semantics may not be guaranteed at present on partial reads.
We use a number fetchSize to limit the records count for every fetch loop but
we didn't record the offset into the split state. When resuming the split
reader from a partially completed split, we may re-read some data.
We should record the current reading offset into split state.
Skip this offset when restoring to prevent duplicate reading.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)