[ 
https://issues.apache.org/jira/browse/FLINK-10856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16685262#comment-16685262
 ] 

ASF GitHub Bot commented on FLINK-10856:
----------------------------------------

tillrohrmann opened a new pull request #7088: [FLINK-10856] Take latest 
checkpoint to resume from in resume from externalized checkpoint e2e test
URL: https://github.com/apache/flink/pull/7088
 
 
   ## What is the purpose of the change
   
   Since it can happen that some empty checkpoint directories are left, we have 
to take the latest
   checkpoint directory in order to resume from an externalized checkpoint. 
This commit changes the
   test_resume_externalized_checkpoint.sh to sort the checkpoint directories in 
descending order and
   then takes the head checkpoint directory.
   
   ## Verifying this change
   
   - Manually tested.
   
   ## Does this pull request potentially affect one of the following parts:
   
     - Dependencies (does it add or upgrade a dependency): (no)
     - The public API, i.e., is any changed class annotated with 
`@Public(Evolving)`: (no)
     - The serializers: (no)
     - The runtime per-record code paths (performance sensitive): (no)
     - Anything that affects deployment or recovery: JobManager (and its 
components), Checkpointing, Yarn/Mesos, ZooKeeper: (no)
     - The S3 file system connector: (no)
   
   ## Documentation
   
     - Does this pull request introduce a new feature? (no)
     - If yes, how is the feature documented? (not applicable)
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Harden resume from externalized checkpoint E2E test
> ---------------------------------------------------
>
>                 Key: FLINK-10856
>                 URL: https://issues.apache.org/jira/browse/FLINK-10856
>             Project: Flink
>          Issue Type: Bug
>          Components: E2E Tests, State Backends, Checkpointing
>    Affects Versions: 1.5.5, 1.6.2, 1.7.0
>            Reporter: Till Rohrmann
>            Assignee: Till Rohrmann
>            Priority: Critical
>              Labels: pull-request-available
>             Fix For: 1.5.6, 1.6.3, 1.7.0
>
>
> The resume from externalized checkpoints E2E test can fail due to 
> FLINK-10855. We should harden the test script to not expect a single 
> checkpoint directory being present but to take the checkpoint with the 
> highest checkpoint counter.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to