[ 
https://issues.apache.org/jira/browse/FLINK-10287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16604505#comment-16604505
 ] 

Sayat Satybaldiyev commented on FLINK-10287:
--------------------------------------------

Might be related to https://issues.apache.org/jira/browse/FLINK-10286

> Flink HA Persist Cancelled Job in Zookeeper
> -------------------------------------------
>
>                 Key: FLINK-10287
>                 URL: https://issues.apache.org/jira/browse/FLINK-10287
>             Project: Flink
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 1.6.0
>            Reporter: Sayat Satybaldiyev
>            Priority: Major
>         Attachments: Screenshot from 2018-09-05 16-48-34.png
>
>
> Flink HA persisted canceled job in Zookeeper, which makes HA mode quite 
> fragile. In case JM get restarted, it tries to recover canceled job and after 
> some time fails completely being not able to recover it. 
>  
> How to reproduce:
>  # Have Flink HA 1.6 cluster
>  # Cancel a running flink job
>  # Observe that flink didn't remove ZK metadata.
> !Screenshot from 2018-09-05 16-48-34.png!
> {code:java}
> ls /flink/flink_ns/jobgraphs/46d8d3555936c0d8e6b6ec21cc02bb11
> [7f392fd9-cedc-4978-9186-1f54b98eeeb7]{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to