[jira] [Commented] (FLINK-4816) Executions failed from "DEPLOYING" should retain restored checkpoint information

ramkrishna.s.vasudevan (JIRA) Tue, 28 Feb 2017 03:41:33 -0800

    [ 
https://issues.apache.org/jira/browse/FLINK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15887850#comment-15887850
 ]


ramkrishna.s.vasudevan commented on FLINK-4816:
-----------------------------------------------

Going through the code, will it better that in the Checkpointcoordinator when 
we assign the restore checkpoint state to the execution job vertices, can we 
just set the latest checkpoint id into the vertices?
So when we call fail() on the Exceution, and we find that the job vertex has a 
non negative checkpoint ID, we could wrap the throwable with 
RestoreTaskException along with the checkpoint id and if the job vertex has a 
non negative ID then wrap it with just DeployTaskException. 
Ping [~StephanEwen]?

> Executions failed from "DEPLOYING" should retain restored checkpoint 
> information
> --------------------------------------------------------------------------------
>
>                 Key: FLINK-4816
>                 URL: https://issues.apache.org/jira/browse/FLINK-4816
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Distributed Coordination
>            Reporter: Stephan Ewen
>
> When an execution fails from state {{DEPLOYING}}, it should wrap the failure 
> to better report the failure cause:
>   - If no checkpoint was restored, it should wrap the exception in a 
> {{DeployTaskException}}
>   - If a checkpoint was restored, it should wrap the exception in a 
> {{RestoreTaskException}} and record the id of the checkpoint that was 
> attempted to be restored.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

[jira] [Commented] (FLINK-4816) Executions failed from "DEPLOYING" should retain restored checkpoint information

Reply via email to