Jiabao Sun created FLINK-31063: ---------------------------------- Summary: Prevent duplicate reading when restoring from a checkpoint. Key: FLINK-31063 URL: https://issues.apache.org/jira/browse/FLINK-31063 Project: Flink Issue Type: Bug Components: Connectors / MongoDB Affects Versions: mongodb-1.0.0 Reporter: Jiabao Sun Fix For: mongodb-1.0.0
Exact-once semantics may not be guaranteed at present on partial reads. We use a number fetchSize to limit the records count for every fetch loop but we didn't record the offset into the split state. When resuming the split reader from a partially completed split, we may re-read some data. We should record the current reading offset into split state. Skip this offset when restoring to prevent duplicate reading. -- This message was sent by Atlassian Jira (v8.20.10#820010)