pnowojski opened a new pull request #14807:
URL: https://github.com/apache/flink/pull/14807


   This PR depends on https://github.com/apache/flink/pull/14797
   
   This PR fixes two bugs. First:
   ```
       If previous checkpoint is declined, it can happen that task receives 
both older and newer
       checkpoint barrier on two different channels, before processing any 
checkpoint cancellation
       message/RPC. If the newer checkpoint barrier happens to be processed 
before the obsolete one
       incorrect `checkState` in ChannelStatePersister would cause job failure. 
This checkState
       was assuming that the previous checkpoint would have been 
aborted/stopped before triggering
       the new one, while in reality, this previous checkpoint has never been 
triggered on this task
       so it also could not have been stopped.
   ```
   Second:
   ```
       This commit fixes a bug where RemoteInputChannel was incorrectly 
deciding which
       buffers should be spilled, if it has received an obsoleted 
CheckpointBarrier,
       that hasn't been cancelled (yet?).
   ```
   Both commits are tested by the existing UnalignedCheckpointITCase and some 
freshly added unit tests.
   
   ## Does this pull request potentially affect one of the following parts:
   
     - Dependencies (does it add or upgrade a dependency): (yes / **no**)
     - The public API, i.e., is any changed class annotated with 
`@Public(Evolving)`: (yes / **no**)
     - The serializers: (yes / **no** / don't know)
     - The runtime per-record code paths (performance sensitive): (yes / **no** 
/ don't know)
     - Anything that affects deployment or recovery: JobManager (and its 
components), Checkpointing, Kubernetes/Yarn/Mesos, ZooKeeper: (**yes** / no / 
don't know)
     - The S3 file system connector: (yes / **no** / don't know)
   
   ## Documentation
   
     - Does this pull request introduce a new feature? (yes / **no**)
     - If yes, how is the feature documented? (**not applicable** / docs / 
JavaDocs / not documented)
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to