Jiabao Sun created FLINK-31063:
----------------------------------

             Summary: Prevent duplicate reading when restoring from a 
checkpoint.
                 Key: FLINK-31063
                 URL: https://issues.apache.org/jira/browse/FLINK-31063
             Project: Flink
          Issue Type: Bug
          Components: Connectors / MongoDB
    Affects Versions: mongodb-1.0.0
            Reporter: Jiabao Sun
             Fix For: mongodb-1.0.0


Exact-once semantics may not be guaranteed at present on partial reads.
We use a number fetchSize to limit the records count for every fetch loop but 
we didn't record the offset into the split state. When resuming the split 
reader from a partially completed split, we may re-read some data.

We should record the current reading offset into split state.
Skip this offset when restoring to prevent duplicate reading.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to